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ABSTRACT 


During  the  last  three  decades,  numerous  authors 

have  attempted  to  simulate  the  process  through  which 

clinicians  make  diagnoses.  Limited  originally  to  simple 
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mechanical  models  '  ,  the  advent  of  electronic  computers 
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created  new  technologies  .  With  the  introduction  of 
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Boolean  Algebra  ,  it  became  possible  to  apply  Bayes  Law 
(a  formula  which  allows  predictions  of  disease  probabil¬ 
ities  from  incidence  data  and  the  occurrence  of  symptoms 
within  diseases)  to  clinical  problems. 

Studies  in  the  radiographic  diagnosis  of  primary 
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bone  tumors  ,  the  acute  abdomen  ,  thyroid  disease  and 

g 

Cushing's  syndrome  have  demonstrated  the  value  of 

9 

Bayesian  models.  Warner  et  al.  studied  congenital  heart 
disease  in  a  population  ranging  from  one  month  to  greater 
than  twenty  years.  Their  work  omitted  the  neonatal 
population,  the  age  during  which  the  most  serious  cardiac 
malformations  manifest.  This  study  applies  the  technique 
of  Bayesian  analysis  to  a  neonatal  population.  The  model 
involves  identification  of  the  important  symptoms  (historical, 
physical  findings,  EKG,  roentgenographic  findings)  and 
diseases  followed  by  completion  of  a  symptom-disease  matrix 
(the  incidence  of  each  disease  as  well  as  the  incidence  of 
each  symptom  within  each  disease) .  Using  a  computer  to 
tabulate  the  enormous  quantities  of  data,  the  program's 
accuracy  ranges  from  fifty-two  to  sixty  percent  in  different 


populations,  compared  to  an  approximate  clinical  accuracy 

of  eighty  percent.  With  variations  in  the  symptom- 

disease  matrix  to  correct  errors  in  the  data,  the  accuracy 

of  the  computer  program  can  be  increased  to  seventy-five 

percent.  In  addition,  the  computer  compiles  a  differential 

which  not  only  resembles  the  clinical  differential  but  at 

times  is  considered  more  complete  or  logical. 

With  improvements  in  the  data  base,  it  is  expected 

that  the  program's  accuracy  can  be  greatly  increased, 

and  could  become  an  effective  adjunct  to  clinical  diagnosis. 

Other  studies  have  demonstrated  the  utility  of  Bayesian 

models  as  clinical^  and  teaching  aids1'*'.  The  possibilities, 

12  13 

including  simulation  of  complex  clinical  problems  '  ,  are 

considerable.  As  more  models  are  developed,  the  limiting 
factor  will  be  physicians'  acceptance  of  computer-based 


statistical  models. 
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APPENDIX 


CHAPTER  I 


The  process  by  which  the  practitioner  of  medicine 
formulates  clinical  decisions  is  complicated.  In  its 
simplest  form,  it  can  be  seen  as  having  three  components. 
First,  the  clinician  interacts  with  the  patient  to  make 
observations  and  accumulate  data.  This  is  in  the  form 
of  obtaining  a  history,  performing  a  phvsical  examination 
and  ordering  the  appropriate  laboratory  procedures. 
Second,  the  clinician  takes  the  information  he  has 
gathered  and  translates  it  into  a  diagnosis,  an  assess¬ 
ment  of  the  disease  process  with  which  the  patient 
presents.  Finally,  the  clinician  formulates  a  plan  of 
therapy  which  reflects  both  the  illness  and  the  opportun¬ 
ities  for  intervention  through  medical  care,  social  or 
possibly  economic  change. 

To  aid  the  clinician  in  learning  the  first  of  these 
tasks,  there  are  many  resources.  Early  in  medical  train¬ 
ing,  he  is  exposed  to  a  course  in  physical  diagnosis. 
Numerous  textbooks  offer  the  trainee  techniques  of  exam¬ 
ination.  During  clinical  training,  through  constant 
exposure  and  experimentation,  the  student  is  able  to 
secure  an  adequate  knowledge  of  physical  diagnosis. 

To  formulate  a  plan  of  therapy,  the  clinician  is 
again  offered  a  large  number  of  resources.  Medical 


literature  abounds  with  textbooks  and  journals,  and 
faculty  and  housestaff  are  available  to  help  with 
clinical  problems.  Having  identified  the  disease  under 
consideration,  the  physician  need  only  turn  to  these 
sources  to  obtain  information  about  a  clinical  problem. 

It  is  the  middle  component  -  the  translation  of 
data  into  diagnosis  -  that  challenges  the  trainee. 

Often,  the  diagnosis  is  readily  apparent  from  the 
history,  physical  findings  or  laboratory  data.  A 
student,  examining  a  patient  with  the  recent  onset  of 
fever,  sputum-productive  cough  and  an  infiltrate  on 
chest  roentgenogram,  need  not  have  great  insight  to 
suspect  the  presence  of  a  pneumonia.  Frequently,  how¬ 
ever,  the  diagnosis  is  more  complicated,  either  from 
the  difficulty  of  obtaining  sufficient  historical, 
physical  examination  or  laboratory  data,  variation  of 
the  patient  from  the  classical  case  or  the  spectrum  of 
presentations  of  the  same  disease. 

Senior  clinicians,  when  asked  to  explain  their 
diagnostic  reasoning,  may  employ  the  term  "clinical 
judgment".  This  implies,  "I  recognize  something  in 
this  patient  which  clues  me  towards  a  specific  diagnosis 
In  identifying  their  decision  process,  some  invoke  the 
presence  of  an  obscure  finding,  others  may  describe  a 
patient  with  a  similar  presentation  they  saw  at  some 
other  time  in  their  career.  Astute  clinicians  are  often 
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able  to  outline  their  decision  processes,  making  explicit 
the  manner  in  which  they  accumulate  and  process  data. 

An  anecdote  offered  by  Mack  Lipkin,  a  psychiatrist  with 
an  interest  in  early  medical  training,  describes  an 
interesting  example: 

When  first  seen,  the  patient  aged  seventy,  was 
lying  on  his  back  in  a  dimly  lit  room  literally 
screaming  because  of  violent  abdominal  pain.  Only 
two  things  were  instantly  apparent  to  the  other 
physician  and  me  -  enormous  abdominal  distension 
and  extreme  generalized  abdominal  tenderness.  The 
history  and  further  examination  showed  that  the 
patient  had  peritonitis  of  unknown  origin.  He 
had  been  ill  for  four  days  but  had  refused  to  call 
a  physician.  He  did  not  remember  where  the  pain 
began  or  whether  he  had  had  a  bowel  movement.  When 
the  surgeon  arrived,  he  was  given  the  information: 
four  days  of  pain,  obvious  peritonitis,  and  a  temp¬ 
erature  of  105  degrees  Fahrenheit.  He  walked  into 
the  patient's  bedroom,  took  one  look,  and  in  about 
five  seconds,  said,  "Ruptured  retrocecal  appendix." 

Asked  why  he  was  so  sure  that  the  offending  organ 
was  the  appendix,  that  it  was  ruptured,  and  that  it 
was  retrocecal,  he  looked  surprised.  He  said,  "What 
else  can  it  be?  Look  where  he  has  burned  himself 
with  his  hot  water  bottle."  The  other  physician  and 
I  looked  at  the  abdomen  which  had  the  reticulated 
pattern  that  results  from  continued  use  of  the  heating 
pad,  stared  at  each  other,  and  then  turned  to  the 
surgeon.  "Why  do  heating  pad  burns  mean  a  ruptured 
retrocecal  appendix?"  He  answered,  "But  look  where 
the  marks  are  darkest."  We  looked  carefully  and  could 
see  no  difference.  He  pointed  to  the  extreme  right 
side  of  the  abdomen.  We  looked  again  and  agreed  that 
perhaps  these  marks  were  slightly  darker.  He  explained 
that  he  knew  of  no  other  condition  that  would  start 
with  pain  in  that  part  of  the  extreme  right  side  of 
the  abdomen  and  go  on  to  produce  peritonitis  except 
a  ruptured  retrocecal  appendix.  His  diagnosis  was 
absolutely  correct. 

Rarely  is  the  clinician  called  upon  to  make  such  difficult 
observations.  Nonetheless,  he  often  makes  diagnoses 
without  having  made  specific  observations  on  all  aspects 
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of  the  patient  or  making  explicit  statements  regarding 
his  observations. 

During  the  last  three  decades,  the  process  by  which 
clinicians  make  diagnoses  has  come  under  careful  scrutiny. 
Initially,  theoreticians  attempted  to  build  simple 
mechanical  models  to  aid  in  the  teaching  of  certain 
diagnostic  problems.  In  the  nineteen  fifties,  Boolean 
algebra  and  logic,  with  their  ability  to  translate 
language  into  mathematical  or  logical  statements,  offered 
researchers  the  opportunity  to  simulate  simple  clinical 
models.  With  the  development  of  high  speed  electronic 
computers  with  their  massive  storage  capacities,  some 
authors  have  simulated  more  complex  clinical  situations. 

The  purpose  of  this  dissertation  is  three-fold. 

First  the  author  will  review  the  history  of  these 
diagnostic  models.  Then,  a  clinical  model  which  has 
been  developed  and  tested  for  use  in  the  diagnosis  of 
congenital  heart  disease  in  the  newborn  period  will  be 
presented.  Finally,  there  will  be  a  discussion  of  the 
implications  of  these  diagnostic  models  in  present-day 
clinical  medicine,  as  well  as  some  of  the  opportunities 
for  future  research  and  clinical  application. 


FOOTNOTES 


1.  Mack  Lipkin,  The  Case  of  Patients  (New  York, 
University  Press,  1974)  pp.  155-156. 


Oxford 


CHAPTER  II 


THE  LITERATURE 

Confronted  with  each  new  medical  problem,  the 
physician  begins  by  recalling  recognized  patterns  of 
disease.  To  systematize  his  thinking,  although  not 
necessarily  consciously,  he  calls  into  action  an 
algorithm,  a  procedure  for  solving  a  problem. 

The  simplest  algorithm  -  one  symptom  or  some 
combination  of  symptoms  or  signs  stimulating  the  physi- 
cain  towards  a  particular  diagnosis  or  category  of 
diagnoses  -  can  be  represented  symbolically  as: 

Symptom (s)  - >  Disease (s) 

To  further  this  process,  the  clinician  may  call  upon  a 
textbook  of  differential  diagnosis,  a  resource  structured 
similarly  to  the  algorithm  shown  above.  Considering 
each  of  the  symptoms  separately,  the  physician  compiles 
lists  which  may  overlap  partially,  completely,  or  not 
at  all.  Extracting  the  common  elements,  he  is  often 
able  to  make  a  significantly  smaller  list,  and  if  addi¬ 
tional  data  is  made  available,  the  list  of  diagnostic 
possibilities  can  be  limited  even  further.  Through  this 
process,  the  clinician  has  extended  the  linear  model 
shown  above  to  a  more  complex  one,  a  model 
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which  involves  the  intersection  of  different  sets. 

A  Mechanical  Model 

If  the  number  of  symptoms  is  extensive,  the  text¬ 
book  of  differential  diagnosis  comprehensive,  or  more 
than  one  disease  present,  the  clinician  may  have  created 
an  enormous  task,  the  culling  and  sorting  of  hundreds 
of  different  diseases  before  the  appropriate  diagnosis 
can  be  entertained.  To  simplify  this  task,  Nash"*-  out- 
ined  in  1954  a  tool  which  he  felt  could  be  beneficial 
for  medical  diagnosis.  The  device,  which  viewed  on  side 
resembles  a  large  slide  rule,  consists  of  a  frame  into 
which  a  number  of  different  strips  can  be  inserted 
(Figure  1  )  .  Into  one  edge  of  the  frame,  disease  names 
are  inscribed  (three  hundred  in  this  case,  but  limited 
only  by  space).  The  strips,  each  of  which  represents 
data  obtained  from  history,  physical  examination  or  labor¬ 
atory  procedures,  are  slipped  into  the  frame,  one  beside 
the  other.  Markings  on  the  "symptom"  strips  then  line 
up  adjacent  to  the  diseases  in  which  they  are  present. 

By  selecting  strips  to  represent  the  patient's  symptoms 
and  observing  the  frame  for  the  location  of  the  greatest 
number  of  markers,  the  clinician  is  able  to  simplify  his 
differential  diagnosis. 

While  rather  cumbersome,  this  device  is  an  attempt 
to  simplify  the  use  of  textbooks  of  differential  diagnosis. 
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FIGURE  1 

A  SIMPLE  MECHANICAL  MODEL 
Nash,  F.  A.,  : Differential  Diagnosis"  The  Lancet 

(Volume  22,  No.  6816,  April  17,  1954)  p.  875. 
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Proposed  before  the  era  of  computational  devices,  it 
applied  the  technology  of  the  era. 

Early  Data  Punching 

Three  years  later,  as  punched  cards  became  available 

2 

to  the  scientific  community,  Lipkin  and  Hardy  utilized 

the  newer  technology.  They  proposed  to  "evaluate  the 

efficiency  with  which  mechanical  classification  and 

correlation  of  data  might  assist  in  the  utilization  of 

3 

data  in  the  differential  diagnosis  of  ...  disease." 

Using  marginal  punched  cards  (Figure  2  ) ,  a  type  of  data 
card  in  which  each  hole  punched  around  the  edge  of  the 
card  represents  a  specific  piece  of  information,  one 
space  corresponding  to  every  observation  to  be  considered 
they  selected  138  findings  from  history,  physical  examina 
tion,  peripheral  blood,  bone  marrow  and  other  laboratory 
examinations.  Twenty-seven  hematological  diseases  were 
classified  according  to  these  criteria  and  a  "master 
card"  prepared  for  each  disease  by  punching  a  wedge 
between  the  appropriate  hole  and  card  edge. 

Eighty  case  records  from  a  university  hospital  were 
then  abstracted  and  the  appropriate  information  punched 
on  marginal  cards.  The  cards  were  tabulated,  apparently 
by  comparing  each  patient  card  to  each  of  the  27  disease 
cards.  For  fifty  of  the  cases  an  exact  correlation  to 
one  disease  was  obtained,  and  in  each  it  was  identical 


FIGURE  2 


A  MARGINAL  PUNCHED  CARD 
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to  the  clinical  diagnosis  obtained  from  the  hospital 
chart.  In  twenty-three ,  the  data  from  the  case  corres¬ 
ponded  to  more  than  one  of  the  master  cards.  For  these, 
additional  data  was  obtained  from  the  charts  and  when 
retabulated  each  of  the  diagnoses  was  made  precisely. 

In  only  seven  cases  was  the  data  not  identical  to  any 
of  the  master  cards.  In  returning  to  these  medical 
records,  each  patient  was  found  to  have  more  than  one 
hematological  disease. 

4 

A  year  later,  Lipkm  and  Hardy  presented  a  more 

sophisticated  model  using  the  same  data  in  conjunction 

with  mechanical  techniques.  Now  using  26  diseases 

punched  as  before,  they  took  advantage  of  the  opportunities 

offered  by  marginal  punching,  a  process  termed  "direct 
5 

sorting"  .  To  find  all  the  diseases  characterized  by 
a  single  observation,  for  example,  splenomegaly,  "one 
would  place  this  set  of  cards  front  to  back  and  place  a 
metal  or  plastic  rod  into  the  hole  to  which  that  item  had 
been  assigned.  When  the  rod  was  raised,  cards  represent¬ 
ing  diseases  characterized  by  a  large  spleen  would  fall, 
because  the  triangular  wedge  would  have  been  punched  into 
that  space.  Cards  without  the  wedge  would  be  raised"^  . 
Discarding  the  eliminated  diagnoses  and  inserting  a  rod 
corresponding  to  a  second  symptom,  the  number  of  "eligible" 
diagnoses  could  be  further  limited.  When  the  symptom 
list  was  exhausted  several  possibilities  could  occur: 
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no  disease,  only  one,  or  more  than  one  remained  in 
the  differential.  As  before,  in  73  of  the  80  cases  a 
single  correct  diagnosis  was  obtained.  The  elimination 
of  all  possible  diagnoses  never  occurred  while  in  seven 
cases,  additional  mathematical  manipulation  was  necessary 
to  obtain  the  diagnosis. 

Electronic  Models 

With  the  development  of  high-speed  electronic  compu¬ 
ters,  attempts  were  made  to  utilize  the  newer  technologies. 
7 

Barness  proposed  a  computer-assisted  diagnostic  program 

which  presently  include  1,500  different  diagnoses.  The 

patient's  symptoms,  selected  from  a  comprehensive  list 

of  650  abnormal  findings  (75  historical,  315  physical, 

60  roentgenographic ,  and  200  laboratory  findings)  are 

entered  via  a  teletypewriter  to  a  remote  computer  facility. 

"The  computer  systematically  searches  its  data  bank, 

comparing  the  patient's  findings  with  each  diagnosis. 

The  computer  selects  the  diagnoses  that  should  be  included 

in  a  complete  differential  diagnosis.  These  are  listed 

8 

in  a  computer  report  by  the  teletypewriter  terminal" 

To  apply  the  system  to  clinical  situations  hospital 
records  were  reviewed  to  find  pediatric  patients  whose 
final  diagnosis  was  not  included  in  the  admitting  differ¬ 
ential.  Twenty  charts  were  abstracted  and  abnormal 
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findings  selected  from  the  initial  history.  When 

tabulated  by  the  program,  for  eighteen  of  the  twenty 

cases  the  differential  provided  by  the  computer 

included  the  discharge  diagnosis.  Although  the  program 

is  presently  extremely  expensive  -  each  run  costs  $50  - 

the  authors  contend  that  with  general  use  its  price 

could  become  "similar  with  that  of  a  customary  laboratory 
9 

test"  .  In  addition,  the  differential  diagnosis  is 
often  too  long  to  be  clinically  useful. 

Decision  Trees 

Alternatively,  one  can  employ  decision  trees  as 

10 

diagnostic  aids.  A  typical  one,  proposed  by  Krovetz 
(Figure  3 )  offers  the  user  a  number  of  decision  nodes 
from  which  to  choose.  After  each  selection  is  made, 
he  reaches  another  node,  until  finally,  a  minimal  number 
of  diagnoses  can  satisfy  the  acknowledged  criteria. 

Decision  trees,  while  very  useful  in  simple  and 
"classical"  cases,  tend  to  have  significant  limitations. 
Atypical  cases,  or  those  which  present  in  varied  ways, 
tend  to  become  lost  in  the  decision  process.  For  example, 
Krovetz's  cardiological  model  uses  "obvious  cyanosis" 
as  its  first  nodal  point.  Following  each  of  the  superior 
alternatives  -  no  cyanosis,  no  thrill,  decreased  pulmonary 
markings  -  one  reaches  a  point  at  which  no  diagnoses  are 
tenable.  The  authors  then  suggest  reexamination  for 


'•' * - ,  CwieO'. 
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FIGURE  3 

A  branching  seauence  for  the  diagnosis  of  congenital 

,.  heart  disease. 

From  r  KF8vW€z,  JeroMey  et  ■  al .  y  ••  Handbook  of  Pediatric. 

Cardiology  (New  York,  Harper  and  Row,  1969) ,  p. 


15 


cyanosis,  allowing  for  either  observer  error  or  varia¬ 
tions  in  hematocrit. 

An  attempt  to  circumvent  these  limitations  was 

11 

proposed  by  Kleinmuntz  .  His  decision  tree  involves 

neurological  diagnosis,  "an  example  par  excellence  of 

diagnostic  problem  solving  . . .  because  of  the  highly 

12 

structured  nature  of  the  clinical  data  within  it." 

The  scheme  evolves  from  a  game  of  "Twenty  Questions" 

in  which  the  experimenter  thinks  of  a  disease  while  the 

other  participants  are  invited  to  ask  questions  in 

pursuit  of  the  diagnosis.  The  information  provided  by 

the  neurologist,  characterizing  symptoms  SI , S2 , S3 , etc . 

(Figure  4 )  may  involve  history,  physical  exam  or  laboratory 

data.  Following  completion  of  the  game,  a  diagnostic 

tree  is  built,  which  is  intended  "to  be  an  information 

processing  theory  of  ...  diagnostic  behavior....  [The] 

program  is  a  theory  that  explains  the  diagnostic  problem 

solving  behavior  of  individual  clinical  neurologists  in 

„  13 

a  given  set  of  n  situations. 

Two  distinct  models  of  diagnosis  are  proposed  by 
Kleinmuntz.  The  computer  could  "have  stored  in  its 
memory  core  all  the  known  symptoms,  signs,  demographic 
variables,  laboratory  tests  and  treatments,  and  then 
when  a  particular  set  of  symptoms  or  signs  are  presented 
to  it,  the  computer  could  proceed  very  much  like  a 
beginning  medical  student  who  might  consult  a  diagnostic 


1st  Episode  ? 

Did  the  hemiparesis 
improve  ? 

Did  the  scotoma 
improve  ? 


Did  the  mao  have  any  vertigo 
with  his  attack  ? 

Did  he  lose  consciousness  ? 

Does  he  have  diminished 
carotid  pulsation  ? 


Does  he  have  a  bruit  over  his 
carotid  ? 

Does  he  have  a  carotid 
stenosis  on  the  left  ? 

OK,  the  diagnosis  is 
stenosis  of  the  left 
carotid  artery 

Flr"  L  A  tree  structure  of  a  neurologist's  dia- 
unostic  gamc  in  which  the  information  given  was: 
■Sudden  left  central  scotoma  and  right  hemiparesis 
in  a  55  year  old. 


FIGURE  4 

A  neurologist's  branching  sequence. 

From:  Kleinmuntz,  Benjamin,  "Diagnostic  Problem  Solving 

by  Computer"  Japanese  Psychological  Research 
(Volume  7,  No.  4,  1965)  p.  192. 
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manual  or  textbook."  "  In  addition  to  being  very  tedious, 
this  program  would  teach  very  little  about  the  "diag¬ 
nostic  process".  Alternatively,  the  program  could 
incorporate  all  the  above  information  "and  take  into 
consideration  the  relative  frequency  of  a  set  of  symptoms 
for  a  particular  disease  and  the  frequency  of  occur¬ 
rence  of  a.  particular  disease  in  a  particular  geographic 
„15 

region . 

One  model  employing  these  techniques  appears  in  a 
recent  paper  by  Brand,  Dove  and  Meyers.^  To  character¬ 
ize  the  difference  between  non-bacterial  and  bacterial 
meningitis,  they  examined  the  records  of  several  hundred 
children  seen  at  the  Yale  New  Haven  Hospital  during 
a  fifteen  year  period.  Several  innovations  appear  in 
their  study  including  derivation  of  "cutoff  points" 
from  data  analysis  rather  than  arbitrary  criteria,  and 
use  of  statistical  techniques  rather  than  "clinical 
judgment"  to  determine  which  criteria  discriminate 
"best".  Above  all,  the  branching  sequence  does  not 
offer  only  one  diagnosis  at  the  completion  of  all  the 
nodes  but  considers  at  all  points  the  likelihood  of 
bacterial  versus  non-bacterial  meningitis  (Figure  5  ) . 

Only  if  the  likelihood  were  zero  or  100%  would  this 
be  equivalent  to  the  traditional  branching  sequence. 


COMPLETE  PROBABILITY  TREE ; 

DISTRIBUTION  OF  CASES  BASED  ON  COMBINED  PATIENT  AGE,  CSF  GRAM  STAIN,  CSF  GLUCOSE  LEVEL 

AND  CSF  WHITE  CELL  COUNT 


FIGURE  5 

A  branching  sequence  for  the  differentiation  of 
non-bacterial  and  bacterial  meningitis.  \ 

From:  Brand,  Don,  et  al . ,  "A  Technique  ror  Analyzing 

Clinical  Data~~to  Provide  Patient  Management 
Guidelines:  A  Study  of  Meningitis  in  Children 
Submitted  for  publication. 
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Statistical  Models 

17 

In  1964,  Lipkin  proposed  a  statistical  model 
to  supplement  his  earlier  work.  Recognizing  that  the 
marginal  card  technique  was  unable  to  differentiate 
between  important  and  less  important  data,  he  sought 
to  weigh  the  relative  value  of  symptoms. 

Using  twenty-six  hematological  diseases,  he  listed 

the  138  symptoms.  "A  numerical  value  or  weight  was 

then  assigned  to  each  item  of  information  present  in 

each  of  the  diseases.  The  instructions  stated:  given 

a  disease  description,  assign  a  positive  weight  to  data 

that  contribute  to  the  diagnosis,  give  a  negative  weight 

to  data  that  do  not.  Let  the  significance  of  the  item 

in  the  disease  determine  the  weight....  The  sum  of 

all  the  weights  in  each  disease  will  yield  the  total 

positive  weight  and  the  total  negative  weight  of  the 

disease....  By  comparing  the  sum  of  the  weighted 

diagnostic  criteria  of  the  hospital  case  to  the  sum  of 

the  weighted  diagnostic  criteria  of  the  disease,  the 

degree  of  identical  fit  of  hospital  case  data  in  each 

1 8 

case  was  defined." 

Figure  6  shows  the  profile  of  one  disease,  followed 
by  the  same  profile  superimposed  on  a  patient  profile, 
yielding  a  Diagnostic  Score.  The  Diagnostic  Index  is: 


D.  I . 


Z  X  •  W  /  E  W 


Disease  i 


Symptoms 

Fig.  2.— Schematic  profile  of  one  disease.  The  abscissa  records  symptoms  1-8;  the  ordinate  records 
the  weight  of  each  symptom.  The  dotted  line  connects  the  weights  and  emphasizes  groups  of  symp¬ 
toms  having  high  and  low  weights.  The  arrow  defines  a  pathognomonic  symptom. 


Diagnostic  Score 

ol  value  tound 
(patient's  value! 
and  value  expected 
(symptom  weight! 


0.8 

0.2 

0.8 

0J 

2.1 


Symptoms 

Patient's  symptoms 
•  present 
-  -  afisent 


Total  disease 
weight  ’4.8 


Fig.  3. — The  same  disease  profile  ( solid  lines)  and  the  profile  of  a  patient  ( dashed  lines)  superim¬ 
posed.  The  value  found  (patient’s  value)  is  multiplied  by  the  value  expected  (symptom  weight)  to 
produce  a  Diagnostic  Score.  Further  division  by  the  total  disease  weight  produces  the  Diagnostic  In¬ 
dex.  The  latter  measures  the  degree  of  identical  ft  of  the  patient’s  data  in  each  disease. 


FIGURE  6 


From: 


Lipkin,  Martin,  "The 
Differential  Diagnos 
and  Biology  (Summer, 


Likelihood  Concept  of 
j_s "  Perspectives  in  Medicine 
1964) ,  p.  488-489  .  ” 
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where  X  (1  or  0)  corresponds  to  the  symptom  being 
present  or  absent.  W  is  the  weight  assigned  to  that 
item  in  that  disease. 

"If  a  hospital  case  contained  all  the  findings 

of  a  given  disease,  the  Diagnostic  Index  was  1.00. 

If  a  hospital  case  contained  none  of  the  findings 

that  characterized  a  disease,  the  Diagnostic  Index 

was  zero.  If  it  contained  one-half  the  findings  of  the 

disease,  the  Diagnostic  Index  was  0.5."^ 

Figure  7  demonstrates  several  representative 

cases,  comparing  hospital  diagnoses  to  diagnostic 

indices.  Lipkin  notes  that  "when  the  Diagnostic  Index 

approached  0.2  the  disease  in  question  became  important 

in  the  differential  diagnosis  and  had  many  features 

identical  to  those  in  the  hospital  case.  When  the 

Diagnostic  Index  reached  0.4,  the  disease  was  very 
20 

important" . 

A  second  statistical  model,  for  the  diagnosis  of 

thyrotoxicosis,  was  authored  by  Crooks,  Murray  and  Wayne 

in  1958.  Their  method  "consisted  in  allocating  a 

positive  or  negative  score  to  each  clinical  feature, 

the  values  being  based  on  an  analysis  of  the  relative 

frequency  of  symptoms  and  signs  in  the  disease.  In 

this  way  a  total  score,  or  clinical  diagnostic  index, 

22 

can  be  obtained  in  each  case."  Signs  and  symptoms 


COMPARISON  OF  WEIGHTED  AV E  RAGES  AND  HOSPITAL  DIAGNOSES 

FOR  SEVEN  CASKS 


C  use 

Diagnostic  Index 
( weighted  averages,  positive) 

Hospital  diagnoses 

1 

Chronic  lymphatic  leukemia  0.41 

Acute  hemolytic  anemia  0.2S 

Idiopathic  thrombocytopenic  purpura  0.27 
Hemophilia  0.20 

Chronic  lymphatic  leukemia 

Acquired  hemolytic  anemia 

2 

Agranulocytosis  0.40 

Chronic  lymphatic  leukemia  0.45 

Sprue  0.21 

Hereditary  spherocytosis  0.  19 

Chronic  lymphatic  leukemia 
Agranulocytosis 

3 

Mediterranean  anemia  0.28' 

Hemophilia  0.23 

Acute  hemolytic  anemia  0.22 

Multiple  myeloma  0.22 

Mediterranean  anemia 
Nonthrombocytopenic  purpura 

i 

4 

Hypochromic  microcytic  anemia  0.31 

Idiopathic  thrombocytopenic  purpura  0.27 
Multiple  myeloma  0.19 

Mediterranean  anemia  0.  15 

Idiopathic  thrombocytopenic  purpura 
Anemia,  acute  and  chronic 

5 

Multiple  myeloma  0.37 

Aplastic  anemia  0.34 

Agranulocytosis  0.30 

Chronic  lymphatic  leukemia  0.09 

Multiple  myeloma 

Agranulocytosis 

6 

Agranulocytosis  0.47 

Chronic  lymphatic  leukemia  0.21 

Eosinophilic  leukemia  0.  19 

Monocytic  leukemia  0.18 

Agranulocytosis 

Macrocytic  anemia 

7 

Agranulocytosis  0.47 

Aplastic  anemia  0.21 

Chronic  lymphocytic  leukemia  0. 19 

Acute  posthemorrhagic  anemia  0.19 

Macrocytic  anemia 

Neutropenia 

Thrombocytopenia 

FIGURE  7 

Diagnostic  indices  for  sample  hematological  cases 

From:  Lipkin,  Martin,  "The  Likelihood  Concept  of 

Differential  Diagnosis"  Perspectives  in  Medicine 
and  Biology  (Summer,  1964),  p.  487. 
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v;ere  chosen  "because  they  had  previously  been  shown 

by  a  clinical  survey  to  differ  in  their  incidence  in 

23 

thyrotoxic  and  normal  subjects.  Weights  were  assigned 

to  the  signs  such  that  "the  positive  or  negative  values 

of  these  scores  were  . . .  allocated  on  the  basis  of  the 

24 

relative  diagnostic  significance  of  each"“  as  found  in 
previous  studies  (Figure  8  ) .  After  some  modifications 

necessary  to  accomodate  for  observer  variation,  the 
clinical  diagnostic  indices  were  calculated.  In  a  group 
of  patients  who  were  clinically  "non-toxic",  the  scores 
ranged  from  -16  to  +10  while  definitely  "toxic" 
patients  ranged  from  +21  to  +42.  One  hundred  eighteen 
patients,  for  whom  an  initial  clinical  diagnosis  was 
unclear  despite  radioactive  idodine  studies,  were  placed 
in  a  "doubtful"  group.  Of  the  sixty-seven  felt  to  be 
non- toxic  by  final  clinical  diagnosis,  fifty-nine  had 
diagnostic  indices  in  the  non-toxic  range  (less  than  11), 
seven  in  the  intermediate  range  and  one  in  the  toxic 
range  (twenty  or  greater) .  Of  these  eventually  felt 
to  be  toxic,  none  fell  into  the  non- toxic  range,  six 
in  the  intermediate  area  and  forty-five  had  diagnostic 
indices  which  correlated  with  their  clinical  status. 
Exluding  the  single  non-toxic  patient  with  a  toxic 
index,  these  data  would  suggest  that  a  diagnostic 


Table  I 


Weighting  Factors  Allocated  to  the  Symptoms  and  Signs  of  Thyrotoxicosis 

Signs 


Symptoms  of  recent  onset  or  increased 
severity 


Present 

Absent 

(score) 

(score) 

Dyspnoea  on  effort 

+ 1 

Palpitations  . 

+  2 

Tiredness 

+  2 

Preference  for  heat  (irre¬ 

spective  of  duration)  . 

-5 

Preference  for  cold 

+  5 

Indifferent  to  tempera¬ 

ture  .... 

0 

Excessive  sweating 

+  3 

Nervousness . 

+  2 

Appetite  increased 

+  3 

Appetite  decreased 

-3 

Weight  increased  . 

-3 

Weight  decreased  . 

+  3 

Present 

Absent 

(score) 

(score) 

Palpable  thyroid 

+  3 

-3 

Bruit  over  thyroid 

+  2 

-2 

Exophthalmos 

+  2 

Lid  retraction 

+  2 

Lid  lag 

+  1 

Hyperkinetic  movements 

+  4 

-2 

Fine  finger  tremor. 

+  1 

Hands : 

Hot  .... 

+  2 

-2 

Moist 

+  1 

-1 

Casual  pulse  rate: 

Auricular  fibrillation  . 

+  4 

Regular  rate: 

Under  80 

-3 

80-90  . 

0 

Over  90. 

+  3 

FIGURE  8 


From: 


Crooks,  J.,  et  al.,  "Statistical  Methods  Applied 
to  the  Clinical  Diagnosis  of  Thyrotoxicosis 
Quarterly  Journal  of  Medicine  (Volume  28,  No.  11 
April,  1959)  p.  212. 
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index  of  ten  or  less  represented  a  non-toxic  patient 

while  at  greater  than  twenty,  hyperthyroidism  was 

present.  In  the  intermediate  range,  a  relatively  small 

group,  no  conclusions  could  be  drawn. 

A  third  statistical  model  was  proposed  by  Collen, 

25 

et  al.  ,  for  use  in  the  Kaiser  Foundation  Health  Plan. 

As  part  of  a  multiphasic  screening  program,  an  attempt 

was  made  to  identify  "whether  there  is  sufficient 

likelihood  of  disease  being  present  to  warrant  further 

2  6 

specific  diagnostic  testing".  One  example,  the 

differentiation  of  asthmatic  patients  from  those  without 

asthma,  involved  a  "likelihood"  principle  proposed  by 
27 


Neyman 


The  likelihood  ratio  "is  the  ratio  of 


the  probability  (P  )  with  which  a  selected  set  of 
symptoms  (S)  occurs  in  a  specific  disease  (D)  to  the 

C 

probability  ( P^ )  with  which  the  same  set  of  symptoms  (S) 

S  S 

occurs  in  the  non-diseased  state  (N) ;  that  is  0  =  Pp/P^. 
Each  symptom  set,  therefore,  is  associated  with  a  0  value. 
The  0s  are  then  arranged  in  order  of  increasing  magni¬ 
tude."  28 


"Six  dichotomous  questions  were  selected  from  a 
questionnaire  form  given  to  230  patients  with  a  clinical 
diagnosis  of  bronchial  asthma  and  to  a  group  of  517 
randomly  selected  patients  who  . . .  were  known  to  be  free 
of  asthma."  The  results,  shown  in  Figure  9  demonstrate 
an  area  in  which  the  likelihood  is  zero  and  another  with 
a  likelihood  of  infinity.  Interestingly,  seven  percent 
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of  the  asthmatics  and  68  percent  of  the  non-asthmatics 

answered  no  to  all  the  questions,  providing  a  likelihood 

of  .102  for  that  set  of  responses. 

These  results  were  used  to  select  a  "positive 

region  which  locates  the  combination  of  questions  which 

screen  out  as  positive  the  maximum  number  of  patients 

30 

with  asthma  with  a  low  error  of  false  negative." 

Recognizing  that  "for  bronchial  asthma,  an  error  in 

diagnosis  may  not  be  of  immediate  serious  consequences, 

a  relatively  high  ...  [likelihood  ration  cutoff]  for  this 

31 

test  may  be  selected." 


Boolean  Algebra 

With  the  development  of  statistical  diagnostic  models, 

it  became  apparent  that  diagnosis  lent  itself  to  the 

32 

use  of  algorithms.  It  was  Ledley  and  Lusted  in  their 

article  "Reasoning  Foundations  of  Medical  Diagnosis"  who 

introduced  Boolean  Logic  to  medical  diagnosis.  They 

suggested  that  "the  first  step  in  making  a  logical 

analysis  of  this  process  is  to  review  some  symbolism 

associated  with  the  propositional  calculus  of  symbolic 

logic.  Such  symbolism  enables  the  more  precise  communica- 

33 

tion  of  the  concepts  involved  in  logical  processes." 
Symbols,  x,y,z...,  were  introduced  to  represent  attributes 
such  as  a  sign  (fever)  or  disease  (pneumonia) .  Correspond¬ 
ing  capital  letters  were  used  in  statements  about  these 
attributes.  "For  example,  y  represents  the  sentence: 
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The  patient  has  the  attribute  y.  The  negation  of  this 

statement,  the  patient  does  not  have  the  attribute  y, 

is  represented  by  Y,  where  the  bar  (called  negation) 

34 

over  the  Y  indicates  'not'." 

Applying  Boolean  logic,  the  statement  X*Y,  the 
intersection  of  the  separate  statements  X  and  Y,  implies 
that  the  patient  has  both  the  attribute  x  and  the 
attribute  y.  The  statement  X  +  Y,  the  union  of  the 
statements  X  and  Y,  indicates  the  group  which  has 
attribute  x,  attribute  y  or  both.  A  third  statement, 

X  Y,  represents  the  logical  statement,  "If  the  patient 
has  attribute  x,  he  then  has  attribute  y." 

35 

The  authors  generated  three  functions: 

1.  E  (S  (1)  ,  .  .  .  ,S  (n)  ,D  (1)  ,  .  .  .  ,D  (m)  )  which 

represents  the  relationships  between 
diseases  the  symptoms  that  comprise 
medical  knowledge, 

2.  G  (S ( 1)  , . . . , S  (n) )  which  represents  the  symptoms 

presented  by  a  patient,  and 

3.  f  (D  (1)  , . . . ,D (m) )  which  represents  the  diagnosis 

as  Boolean  function  of  the  diseases 
alone . 

"The  logical  aspect  of  the  medical  diagnosis  problem 
is  to  determine  the  diseases  f  such  that  if  medical 
knowledge  E  is  known,  then:  if  the  patient  presents 
symptoms  G,  he  has  diseases  f.  In  terms  of  our  symbolic 
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language,  the  problem  is  to  determine  a  Boolean  function 
f  that  satisfies  the  following  formula: 

E  ->  (  G  -*  f  ) 

3  6 

This  is  the  fundamental  formula  of  medical  diagnosis." 

The  difficulty  of  application  of  these  techniques 
to  medical  diagnosis  lies  not  in  the  theory  but  in  the 
lack  of  direct  association  between  cause  and  effect, 
or  in  practical  terms,  symptom  and  diagnosis.  For 
example,  in  a  certain  disease,  there  may  be  a  seventy- 
five  percent  chance  of  a  symptom  being  present. 


Since  "chance"  or  "probabilities"  enter  into 

medical  knowledge",  then  chance  or  probabilities 

enter  into  the  diagnosis  itself.  At  present,  it 

may  generally  be  said  that  specific  probabilities 

are  rarely  known;  medical  diagnostic  textbooks 

rarely  give  numerical  values,  although  they  may 

use  words  such  as  "frequently",  "very  often"  and 
9  V 

"almost  always". 


The  first  step  in  discussing  a  probabilistic 
analysis  of  medical  diagnosis  is  to  review  some 
definitions  and  important  properties  of  probabili¬ 
ties.  The  concept  of  total  probability  is  concerned 
with  the  following  question.  Suppose  we  select 
at  random  from  our  population  of  patients  one  single 
patient:  what  is  the  chance  or  total  probability, 

that  the  patient  chosen  has  certain  specified 
attributes  f (x,y, . . . ,z) ?  By  definition,  the  total 
probability  is  the  ratio  of  the  number  of  patients 
that  have  these  attributes  to  the  total  number  of 
patients  from  which  the  random  selection  is  made. 

If  the  total  number  of  patients  is  N,  and  if  N(f)  is 
the  number  of  these  patients  with  attributes  f,  then 
the  total  probability  that  a  patient  has  attributes  f 
is : 


P(f) 


N(f)/N 
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The  conditional  probability  is  analagous  to 
the  total  probability,  where  the  selection  is 
made  only  from  that  subpopulation  of  patients 

O  CD 

that  have  the  specified  condition. 


The  problem  now  is:  Which  of  these  choices  is 
most  probable  -  that  is,  which  of  the  disease 
complexes  given  by  the  logical  diagnosis  function 
f  is  the  patient  most  likely  to  have.  In  terms 
of  conditional  probabilities,  the  probabilistic 
aspect  of  the  diagnosis  problem  is  to  determine 
the  probability  that  a  patient  has  disease  f  where 
it  is  known  that  the  particular  patient  presents 
symptoms  G,  that  is,  the  probabilistic  aspect  of 
medical  diagnosis  is  to  evaluate  P(f/G)  for  a 
particular  patient. ^ 9 


The  data  upon  which  the  evaluation  of  P(f/G)  is 
based  must,  of  course,  come  from  medical  know¬ 
ledge.  Such  medical  knowledge  is  generally  also 
given  in  the  form  of  conditional  probabilities  - 
namely,  the  probability  that  a  patient  having  (a) 
disease  complex  will  have  the  symptom  complex.... 

It  is  interesting  to  note  that  most  diagnostic 
textbooks  discuss  the  symptoms  associated  with 
the  disease  rather  than  the  reverse,  the  disease 
associated  with  a  symptom.  The  question  that 
naturally  arises  at  this  point  is:  If  medical 
knowledge  is  in  the  form  P(G/f)  that  is,  probability 
of  having  the  symptoms  given  the  patient  having  the 
disease,  then  how  can  we  make  the  diagnosis  P(f/G)  - 
that  is,  the  probability  of  having  the  disease  given 
the  patient  having  the  symptoms  A  0 

It  is  here  that  Bayes  Law,  a  well-known  formula 
for  relating  cause  and  effects  in  probabilistic  terms, 
becomes  important.  A  derivation  of  this  law  appears  in 
the  following  chapter.  Let  it  suffice  at  present  to 
state  that  Bayes  Law  permits  the  calculation  of  the 
conditional  probability  of  a  disease  in  a  given  patient 
utilizing  only  the  likelihood  of  the  symptom  complex 
within  each  disease. 
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Bayesian  Models 

Numerous  authors  have  developed  Bayesian  models 
to  assist  in  medical  diagnostic  problems.  Amongst 
the  earliest  was  one  proposed  by  Warner,  et  al.41,  in 
1961.  Choosing  congenital  heart  disease,  because  "the 
accuracy  of  diagnosis  . . .  from  clinical  symptoms  may 
be  checked  by  cardiac  catheterization  and/or  findings 
at  surgery,  and  because  relatively  objective  clinical 

a  2 

findings  may  be  easily  obtained,"  "  they  studied  a 
population  ranging  in  age  from  one  month  to  over  twenty 
years.  Fifty  symptoms  felt  to  be  of  value  in  the 
diagnosis  of  these  disorders,  and  thirty-three  diseases 
were  chosen  (Figure  10  ) .  The  symptom-disease  matrix 

(Figure  11  )  was  completed  through  three  sources: 

1.  a  review  of  previously  published  data 

2.  chart  reviews  of  1035  patients  previously 

seen  by  this  group 

3.  "estimates  based  upon  the  pathologic 

physiology  of  the  defect  in  the 

case  of  rare  defects  in  which 

adequate  statistics  were  not  avail- 
43 

able" 

The  authors  observed  that  "it  is  apparent  from  our 
experience  . . .  that  the  most  probable  diagnosis  . . . 
agrees  with  the  actual  diagnosis  made  by  physiologic 


Table  1.— List  of  Symptoms  to  Be 
Evaluated  by  Physician 


Symptoms 


r  xi  =upe  1  mo.  to  1  yr. 
t  <  =u;;e  1  to  "JO  y  r. 

Lxa  =>20  yrs. 

.  xt  =cyanosis,  mild 
,  j  xs  =eyauosis,  severe  (with  clubbing) 

T  |  x.i  =cyanosis,  intermittent 
L \ ~  =cyuuoM»,  differential 
x»  =squatting 
.\u  =dyspuea 
xiu— easy  tatigue 
Xu=orthopnea 
xis=chest  pain 

xi3=repeated  respiratory  Infections 
xn=syncope 

Xis=systolic  murmur  loudest  at  apex 
Xi.i=diastolie  murmur  loudest  at  apex 
|"xi-=systolic  murmur  loudest  in  lett.  4th  interspace 
t  <  xi»=diastolie  murmur  loudest  in  lett  4th  interspace 
l_xi«=euntinuous  murmur  loudest  in  left  4th  interspace 
I  Xao=systolie  murmur  with  thrill  loudest  In  left  2nd  inter- 


1  space 

Xai=svstolic  murmur  without  thrill  loudest  In  left  2nd 
interspace 

X2-j=diustolic  murmur  loudest  in  left  2nd  Interspace 
X:s=continuous  murmur  loudest  in  left  2nd  interspace 
X2i=systolic  murmur  loudest  in  right  2nd  interspace 
Xs5=;diastolic  murmur  loudest  in  right  2nd  interspace 
,  I  X3«=systoIic  murmur  heard  best  over  posterior  chest 
’  1  Xjr— continuous  murmur  heard  best  over  posterior  chest 
.  J  Xss=accentuated  2nd  heart  sound  in  left  2nd  interspace 
T  1  X28=diminished  2nd  heart  sound  in  left  2nd  interspace 
.\3o=ripht  ventricular  hyperactivity  by  palpation 
Xsi=forceftd  apical  thrust 
xa2=pulsittile  liver 

X33=absent  or  diminished  femoral  pulsation 
.  I  X3i=KCt;  axis  more  than  111)3 
T  \  x as— K CO  axis  less  than  0° 

,  i  X3«=I{  wave  greater  than  1.2  inv  in  lead  Vt 
\  X:it=K'  or  i|K  pattern  in  lead  Vi 
X3s=R  wave  greater  than  2.0  mv  in  lead  Vn 
X3»=T  wave  in  lead  Vs  inverted  tno  digitalis) 
j  I  xn»=early  diastolic  murmur  loudest  at  apex 
\  xu=late  diastolic  murmur  loude-t  at  apex 
-  '  xt2=holo-s.vstolic  murmur  loudest  in  left  4th  interspace 
'  \  Xi3=miil-systolic  murmur  loudest  in  left  4th  interspace 
(  j  XM=holo-diastolic  murmur  loudest  in  lett  4th  interspace 
1  Xi3=early  diastolic  murmur  loudest  in  left  4th  interspace 
f x««=mid-systolic  tmmiiur  with  thrill  loudest  in  2nil  left 
interspace 

I  Xi7=holo .systolic  murmur  with  thrill  loudest  in  2nd  left 
t  j  interspace 

I  xis=mid-systolic  murmur  without  thrill  loudest  in  2nd 
left  interspace 

x»»=holn-systolie  murmur  without  thrill  loudest  In  2nd 
„  left  interspace 

xii.=murmur  louder  than  yr  :f  d _ _____ 


Table  2.— List  of  Diseases  Included 
in  Differential  Diagnosis 

Diseases 

yj  =noruud 

y_-  sratiial  septal  defect  without  pulmonary  stenosis  or  pulmonary 
hypertension 

y 3  —a 1 1  ial  sPptal  defect  with  pulmonary  stenosis 
yi  =a trial  septal  defect  with  pulmonary  hvp,.rtension " 
y.-.  ^complete  endocardial  cushion  detect  (A  Y  commune' 
y.i  =parttal  anomalous  pulmonary  venous  connections  (without  atrial 
septal  delect  i 

yr  =  toral  anomalous  pulmonary  venous  connections  t-upiudiuphragm- 
aticl 

ys  =  tricuspid  utiC'ia  without  transposition 
y»  —Ebstein's  anomaly  at  tricuspid  valve 

yii.=vcntricul;ir  septal  detect  with  valvular  pulmonary  stenosis 
y n= ventricular  septal  defect  with  infundibular  -teno-is 
yi2=pulmonar.\  stenosis,  valvular  (with  or  without  probe-patent  fora¬ 
men  ovale) 

yi33=pulmonarv  stenosis,  infundibular  (with  or  without  probc-puient 
foramen  ovale) 
y i i=pulmoua ry  atresia 

y i.3=pitlmonury  artery  stenosis  ^peripheral) 
y m=pulmouary  hypertension,  isolated 
yir=aortie  pulmonary  w  indow 

yi»=patent  ductus  arteriosus  without  pulmonary  liypcrteuslou' 

yi»=pulmonary  arteriovenous  fistula 

y..H.=  mitral  stenosis 

ysi=primury  myocardial  disease 

yaa=anoinaloifs  orinin  of  left  coronary  ariery 

y-j3=aorric  valvular  stenosis 

ysi=$ubaortic  stenosis 

y23=coarctnrion  of  aorta 

y:«=truncus  arteriosus 

y2T=transposcd  great  vessels 

y-js— corrected  transposition 

ya»=absent  aortic  arch 

y»u=ventricular  septal  defect  without  pulmonary  hypertension* 
yai  =  ventriculur  septal  defect  with  pulmonary  hyperieusion* 
y3j=patcnt  ductus  arteriosus  with  pulmonary  hypertension" 
y:i:i=tricuspid  atresia  with  transposition _ _ _____ 


FIGURE  10 

Symptoms  and  diseases  for  a  Bayesian  congenital  heart  disease- 

diagnostic  model. 

From:  Warner,  Homer,  et  al . ,  "A  Mathematical  Approach  to 

Medical  Diagnosis"  Journal  of  the  American  Medical 
Association  (Volume  177,  No.  3,  July  22,  1961)  p.  179. 
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FIGURE  11 

Symptom-disease  matrix  for  a  Bayesian  congenital  heart  disease 

diagnostic  model. 

From:  Warner,  Homer,  et  al . ,  "A  Mathematical  Approach  to 

Medical  Diagnosis"  Journal  of  the  American  Medical 
Association  (Volume  177,  No.  3,  July  22,  1961)  p.  180-181. 
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studies  and  observation  at  surgery  at  least  as  often  as 
does  the  most  probable  diagnosis  estimated  by  three 
experienced  cardiologists  from  the  same  clinical  infor¬ 
mation.  Furthermore,  the  differential  diagnosis 
resulting  from  solution  of  the  equation  is  frequently 
more  complete,  and,  in  retrospect,  often  appears  more 
logical  to  clinicians  than  the  differential  diagnosis 
listed  by  each  of  them  prior  to  seeing  the  equation's 
prediction."  44 

"For  most  radiologists,  having  a  patient  with 

a  primary  bone  tumor  can  be  considered  a  rare  event, 

and  it  is  difficult  for  a  radiologist  to  accumulate  in 

a  lifetime  a  significant  experience  in  the  radiological 

45  46,47 

diagnosis  of  bone  tumors."  The  work  of  Lodwick  suggests 
that  a  Bayesian  model  can  be  of  great  assistance  to  a 
radiologist  confronted  with  films  suggestive  of  a  bone 
tumor.  To  be  of  value  to  the  clinician,  two  criteria 
must  be  met.  The  system  must  be  organized  so  that  the 
clinician  is  not  overwhelmed  by  the  complexity  of  computer 
language.  Second,  definitions  must  be  provided  for  any 
symptoms  or  terms  on  which  a  universal  opinion  does  not 
exist  or  subjective  influence  can  occur. 

Lodwick  has  presented  his  Bayesian  model  in  the  form 
of  a  brochure  containing  several  forms  (Figures  12  and  13  ) 
which  explain  the  system.  Data  is  entered  onto  punch 
cards  and  a  Bayesian  analysis  performed  using  Lodwick' s 
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Guidelines  for  a  Bayesian  bone  tumor  diagnostic  model. 

From:  Lusted,  Lee,  Introduction  to  Medical  Decision  Making 

(Springfield,  Illinois,  Charles  Thomas,  1968). 


Table  1.  Form  used  in  computer-aided  grading  and  diagnosis  of  primary  bum;  iumum 
(Delay  in  diagnosis  and  5-year  survival  are  not  used  by  the  computer) 


Case  Number  (1-6)  . 

1.  Clinical  data  Col.  Yes  No 


Male 

7 

1 

0 

0-9 

8 

1 

-> 

10-19 

9 

1 

2 

20-29 

10 

1 

2 

30-39 

11 

1 

2 

>  40 

12 

1 

-> 

Months  delay 
in  diagnosis 

0  13  1  2 

1-2  14  1  2 

3-11  15  1  2 

>12  16  1  2 

Five  years  survival 
from  diagnosis 

17  1  0 


2.  Size  mm 


00 

18 

1 

01-30 

19 

1 

7 

31-60 

20 

1 

2 

61-90 

21 

1 

91  up 

1 

Tumor  matrix 

mineralization 

Col.  Yes 

No 

Radiolucent 

35 

1 

0 

Cloudy 

36 

1 

0 

Flocculent 

37 

1 

0 

Lumpy 

38 

1 

0 

Solid 

39 

1 

0 

Destruction  of  bone 

Geographic 

40 

1 

0 

Moth-eaten 

41 

1 

0 

Permeated 

42 

1 

0 

Fracture 

43 

1 

0 

Displacement 

44 

1 

0 

Regular  margin 

45 

1 

0 

Lobulated  margin 

46 

1 

0 

Ragged  margin 

47 

1 

0 

Indistinct  margin 

48 

1 

0 

Sharp  edge 

49 

1 

0 

Smudged  edge 

50 

1 

0 

Invasive  edge  1  cm 

51 

1 

0 

Inv.  edge  gr.  1  cm 

52 

1 

0 

Penetration  of  cortex 

(None 

53 

1 

2 

{  Partial 

54 

1 

2 

(Total 

55 

1 

2 

6.  Proliferation  of  bone 


Location 

Sclerotic  rim 

56 

1 

0 

(  Central 

23 

2 

Mottled 

57 

1 

0 

-j  Eccentric 

24 

2 

Endostotic 

58 

1 

0 

(Cortex  or  surface 

25 

2 

Hyperostotic 

59 

1 

0 

Trabeculated 

60 

1 

0 

Pelvis  or  sacrum 

26 

2 

Buttressed 

61 

1 

0 

Other  flat  bones. 

(No  expanded  cortex 

62 

1 

2 

including  ribs. 

-(Expanded  10  mm 

63 

1 

2 

skull,  mandible 

27 

2 

(Expanded  11  mm  up 

64 

1 

-i 

Small  bones, 

'No  Codman’s  triangle 

65 

1 

2 

including  patella 

28 

2 

1  Codman's 

66 

1 

2 

Tubular  bones. 

' 

2  Codman’s 

67 

1 

2 

including  clavicles 

29 

-  2 

3  or  more  Codman's 

68 

1 

-> 

(No  periostosis 

69 

1 

2 

Subdivisions  of 

-[  Laminated  perio. 

70 

1 

2 

tubular  bones 

Amorphous  perio. 

71 

1 

2 

Articular  surface 

30 

0 

Epiphysis 

31 

0 

'No  spiculation 

72 

1 

•> 

Growth  plate 

32 

0 

Regular 

7? 

1 

-> 

Metaphysis 

33 

0 

1 

Hair-on-end 

74 

1 

-> 

Shaft 

34 

0 

Velvet 

75 

1 

Note:  Within  each  bracketed  group  only  one  item  may  be  checked  yes. 


FIGURE  13 

Questionnaire  for  a  Bayesian  bone  tumor  diagnostic  model. 

From:  Lodwick,  Gwilym,  et  al.  ,  "Computer  Aided  Diagnosis 

of  Radiographic  Images"  Journal  of  Chronic  Diseases 
(Volume  19 r  No.  4,  April,  1966)  p.  489. 
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"prior  distribution"  tables.  The  accuracy  of  the 
system  is  high,  with  greater  than  ninety  percent  agree¬ 
ment  with  the  histological  type  for  nine  cell  categories. 
The  data  analysis  was  tested  using  two  separate  matrices, 
the  first  based  on  data  from  a  Sarcoma  Registry,  the 
literature  and  Lodwick's  personal  file;  the  second, 
employing  greater  input  from  Lodwick's  personal  exper¬ 
ience  and  expectations.  Interestingly,  the  second 
symptom-disease  matrix  yielded  results  five  percent 
better  than  the  first. 

A  criticism  often  levelled  at  computer-dependent 
diagnostic  models  is  their  relative  inaccessibility; 
the  results  frequently  are  not  available  until  after 
the  clinical  diagnosis  has  been  obtained.  The  largest 
Bayesian  study  to  date,  a  model  for  the  diagnosis  of 
abdominal  pain ^ '  Offers  a  solution.  The  study  involves 
an  "on-line"  facility  within  the  Professorial  Surgical 
Unit  at  Leeds,  England,  into  which  the  clinician  "logs" 
the  appropriate  information.  Following  the  Bayesian 
computations,  the  teletypewriter  prints  a  "hardcopy" 
including  a  summary  of  the  diagnostic  findings,  the 
computer's  "diagnoses",  a  list  of  additional  physical 
findings  to  reexamine,  and  finally,  a  listing  of  rarer 
diseases  to  be  included  in  the  differential  diagnosis 
but  too  rare  to  provide  a  sufficient  data  base  for  computer 
analysis . 
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The  symptoms  used  in  the  model  (Figure  14  )  are  the 

standard  ones  employed  in  the  diagnosis  of  the  "acute 

abdomen".  Seven  diagnoses  -  acute  appendicitis, 

diverticular  disease,  perforated  peptic  ulcer,  acute 

cholecystitis,  acute  intestinal  obstruction,  acute 

pancreatitis  and  non-specific  abdominal  pain  -  were 

chosen  for  the  study  since  they  represented  over  ninety- 

five  percent  of  all  admissions  to  this  unit  with  acute 

abdominal  pain.  The  entity,  non-specific  abdominal  pain, 

5  0 

while  not  an  accepted  medical  entity,  played  a  signifi- 

5 1 

cant  role  m  the  study.  DeDombal  has  noted  that 
every  experienced  surgeon  encounters  a  significant  number 
of  patients  with  acute  abdominal  pain  who  fit  none  of 
the  other  disease  patterns.  In  some,  the  pain  subsides 
spontaneously  and  the  patient  is  sent  home  undiagnosed, 
others  come  to  "negative  laparotomy" ,  a  third  group  have 
a  medical  problem  not  requiring  surgical  approach,  for 
example,  a  urinary  tract  infection,  and  still  others  have 
psychophvsiologic  pain.  Thus,  it  is  important  to  have  a 
category  for  those  patients  admitted  to  a  surgical  unit 
with  acute  abdominal  pain  for  whom  no  surgical  diagnosis 
can  be  obtained  and  surgical  intervention  is  inappropriate. 

To  qualify  for  the  study,  the  criteria  were: 

1.  patient  presented  with  a  chief  complaint  of 


abdominal  pain 


SERIAL  NO 

REG45TMATION  NO. 

caxr*Tt>5 

MAC  PRO 

A.  A. 

*“  44 

J.  CLINICIANS  PRE  OP  D 


fig.  3 — Example  of  case  history,  showing  form  on  to  which  patient  data  are 
copied  for  later  entry  into  the  computer. 


FIGURE  14- 

Questionnaire  for  a  Bayesian  abdominal  pain  diagnostic  model 

Horrocks ,  J.C.,  et  al . ,  "Computer-Aided  Diagnosis" 
British  Medical  Journal  (Volume  2,  April  1,  1972)  p. 
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2.  the  pain  was  of  less  than  one  week's 

duration 

3.  the  admission  was  an  emergency  procedure 

through  the  emergency  room 

4.  the  patient  had  not  previously  entered 

into  the  study 

5.  the  patient  was  capable  of  giving  a  history 

(this  excluded  only  two  patients,  a 
baby  aged  two  weeks  and  a  comatose 
patient) 

6.  a  diagnosis  was  eventually  made  (excluding 

one  patient  for  whom  a  histological 
diagnosis  was  indecisive) . 

The  results  of  the  study  are  striking.  Compared  to 
each  clinician,  the  computer  yielded  consistently  better 
results,  as  shown  in  Figure  15  .  The  diagnosis  of  acute 
appendicitis,  the  most  frequent  cause  of  the  acute 
abdomen,  provided  data  of  great  interest.  Of  the  eighty- 
five  patients  with  acute  appendicitis,  the  computer 
diagnosed  84.  The  Senior  clinicians  recognized  only  75 
and  classified  six  into  the  non-operative  category,  often 
delaying  surgery  several  hours.  While  the  computer  erred 
in  diagnosing  nine  additional  patients  as  having  acute 
appendicitis,  the  clinicians  (without  knowledge  of  the 
computer's  diagnosis)  performed  more  than  twenty  negative 


liagnnstic  accuracy 


n.s. 


reg.  senior  computer 

clinician 


Figure  2.  Comparison  of  overall  human  and  computer-aided  diagnostic  accuracy. 


FIGURE  15 

From:  deDombal,  F.T.,  "Surgical  Diagnosis  assisted  by  a 

Computer"  Proceedings  of  the  Royal  Society  of  London 
(Volume  184,  1973)  p.  437. 
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laparotomies,  including  those  nine.  DeDombal  states, 

"We  cannot  resist  pointing  out  two  facts.  First,  the 
cost  of  performing  a  'negative  laparotomy'  and  keeping 
the  patient  in  the  hospital  for  a  week  postoperatively 
is  rarely  less  than  fe200  ($500);  and  the  system  would 

have  obviated  the  need  for  34  such  operations  [in  a 
larger  series].  More  important  perhaps,  diagnostic  delay 
which  results  in  perforation  of  an  appendix  increases 
the  mortality  of  appendectomy  tenfold;  the  system's 

52 

predictions  would  have  helped  to  minimize  such  delay." 


Other  diagnostic  models 

Two  other  diagnostic  models  are  worthy  of  comment 

although  it  is  difficult  to  classify  them  into  the 

previous  categories.  In  a  monograph  entitled  "Diagnostic 

53 

Computers",  Caceres  and  Rikli  describe  an  operational 
computer  program  to  "read"  electrocardiograms  and  propose 
diagnoses.  The  protocol  involves: 

1.  minimization  of  "noise"  from  the  circuitry 

2.  choosing  a  fiducial  point 

3.  determination  of  significant  reference 

points 

4.  pattern  recognition  routines. 


Once  the  wave  patterns  are  recognized,  the  data  is 
"condenced  to  those  values  significant  to  diagnosis  by 
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a  discriminant  function  analysis,"  "  and  tested  through 

multiple  combinations.  For  example,  "the  diagnosis  of 

'acute  diaphragmatic  myocardial  infarction'  is  pursued 

by  checking  for  the  presence  of  a  pre-determined  positive 

value  for  Q  duration,  for  negative  T  waves,  and  for 

55 

elevated  ST  segments".  To  obtain  the  final  EKG  diagnosis, 

"All  of  the  resultant  diagnostic  statements  are  initally 

considered  as  only  tentative.  Those  which  are  redundant 

or  secondary  to  diagnosis  of  greater  strength  are 
56 

discarded" . 

To  compare  the  program  to  a  clinician,  750  routine 

EKGs,  of  which  approximately  twenty-five  percent  were 

normal,  were  presented  to  an  independent  cardiologist 

(one  not  aware  of  the  criteria  employed  by  the  program) 

and  to  the  computer.  There  was  complete  agreement  in 

about  73%  of  the  readings.  In  26%,  clinician  and  computer 

agreed  that  the  tracing  was  abnormal,  but  disagreed  on 

the  nature  of  the  abnormality.  In  0.8  percent  of  the 

cardiograms,  the  computer  read  as  normal  something 

considered  abnormal  by  the  clinician.  The  vast  majority 

of  these  were  felt  to  be  "analog  magnetic  tape  playback, 
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digitilization  and  artifacts". 

The  authors  foresaw  improvement  in  diagnostic  accuracy 
with  technical  refinements  and  additional  arrhythmia 
routines.  Considering  the  estimated  cost  of  between  two 
and  four  dollars  per  reading,  with  improvements  the  program 
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could  be  viable  for  clinical  use. 

Diagnostic  Criteria 

The  diagnosis  of  certain  medical  diseases,  particularly 

collagen  disorders,  is  difficult  because  of  the  variety 

of  presentations  of  the  same  disease.  To  surmount  this 

problem,  it  has  been  necessary  to  establish  diagnostic 

criteria.  At  the  meetings  of  the  American  Rheumatism 

Association  in  1954,  there  was  a  demand  for  precise 

diagnostic  criteria  for  rheumatoid  arthritis.  In  1356, 

5  8 

a  committee  headed  by  Ropes  proposed  criteria  for  the 

classes  of  "definite,  probable  and  possible  rheumatoid 

arthritis."  After  two  years  of  use,  the  lack  of  rigor 
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m  the  criteria  was  criticized  and  revised  criteria 
were  published,  adding  a  fourth  category,  classical 
rheumatoid  arthritis. 

The  criteria  (Figure  16  ) ,  while  controversial,  were 

felt  to  have  a  number  of  benefits.  Most  importantly, 

medical  school  faculty  "have  found  these  criteria  helpful 

as  a  framework  from  which  to  discuss  the  disease.  This 

has  proved  true  even  for  some  . . .  who  disagree  with  the 

criteria  or  some  portion  thereof,  for  they  still  have  a 

framework  from  which  to  operate  and  a  specific  opportunity 

6  0 

can  then  be  made  to  express  differing  points  of  view. " 
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FIGURE  16 

Diagnostic  criteria  for  rheumatoid  arthritis. 

From:  Lusted,  Lee,  Introduction  to  Medical  Decision  Making 

(Springfield,  Illinois,  Charles  Thomas,  1968)  p.  65-67. 
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CHAPTER  III 


Bayes  Theorem 

It  was  in  1763  that  Bayes "'“first  proposed  his  theorem 
of  prior  distributions.  Despite  numerous  onslaughts 
from  the  world  of  mathematics,  it  has  appeared  repeatedly 
in  scientific  literature  and  has  recently  entered  the 
medical  literature,  dealing  with  problems  of  diagnosis 
or  prognosis. 

Prior  to  defining  the  theorem,  it  is  necessary  to 
define  a  number  of  mathematical  terms.  These  appear 
in  Figure  17 . 

In  mathematical  theory,  the  probability  of  an  event 
is  a  number  between  zero  (an  impossible  event)  and  one 
(a  certainty).  Typically,  the  result  lies  between  and 
can  be  defined  as: 

P  (D )  =  N(D)/N, 

the  ratio  of  the  number  of  people  with  the  disease  to 
the  entire  population.  Similarly,  the  probability  of  a 
symptom  can  be  defined  as: 


P  (S) 


N ( S ) /N . 


FIGURE  17 


N 

Number  of  people  in  the  population 

N  (D) 

Number  of  people  in  this  population 
with  disease  D 

N  (S) 

Number  of  people  with  symptom  S 

N  (D  n  S)  or  N  (S  f)  D) 

Number  of  people  with  both  the 
disease  D  and  the  symptom  S 

P  (D) 

Probability  of  disease  D 

P  (S) 

Probability  of  symptom  S 

p(s  no)  or  p  (d  ns) 

Probability  of  both  the  disease  D 
and  the  symptom  S 

P(Sl'S2"**Sn) 

Probability  of  a  set  of  symptoms 

S1 ' S2 ' * ' * ' Sn 

P (D/S) 

Probability  of  disease  D,  given 
symptom  S 

P (S/D) 

Probability  of  symptom  S,  given 
disease  D 

p (d/s1,s2, . . • ,sn) 

Probability  of  disease  D,  given 
a  set  of  symptoms,  S-^,S2,...,Sn 
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The  probability  of  a  symptom  being  present,  given  a 
disease,  is  proportional  to  the  number  of  patients  having 
both  the  symptom  and  the  disease,  and  inversely  to  the 
number  of  patients  having  the  disease,  that  is: 

P  (S/D)  =  N(S  fl  D)/N  (D)  . 

In  the  special  case  N  (D)  =  0,  the  formula  is  invalid 
and  P(S/D)  is  undefined.  Dividing  both  numerator  and 
denominator  by  N  yields: 


P  (S/D) 


N  ( S  fl  D )  /N 
N ( D ) /N 


This  can  be  simplified  to: 

P  (S/D)  =  P  (S  D  D)/P  (D)  (1) 

A  similar  derivation  yields  the  result: 

P  (D/S)  =  P (S  0  D)/P (S) 

Crossmultiplying  yields: 

P  (S  n  D)  =  P  (D)  *P  (S/D) 

P  (S  n  D)  =  P  (S)  •  (P  (D/S) 


Thus  : 


P(D)  x  P  (S/D )  =  P(S)  x  P(D/S) 

In  a  different  form  (excluding  the  case  in  which  P(S)  =  0 

P(D)  *  P  (S/D) 

P  (S) 


P (D/S) 
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This  is  Bayes  Law.  It  states  that  the  probability  of 
a  disease  being  present  is  proportional  to  three  factors: 
directly  to  the  incidence  of  the  disease  as  well  as  the 
probability  of  the  symptoms  within  the  disease,  and 
inversely  to  the  probability  of  the  symptoms  with  the 
population.  It  is  important  to  note  that  no  assumptions 
have  been  made  regarding  the  nature  of  the  distributions 
and  the  inter-  or  independence  of  symptoms  or  diseases. 

Other  than  the  assumption  that  the  denominator  cannot  be 
zero,  it  has  been  demonstrated  that  the  probability  of 
a  disease  being  present  given  a  set  of  symptoms  can  be 
universally  defined  by  three  pieces  of  data  -  the  probability 
of  a  symptom  within  a  disease  and  the  probability  of 
each  the  symptom  and  disease  within  the  population  being 
studied  (a  community,  country  or  clinic  population, 

2 

for  example) .  An  example  offered  by  Mount  and  Evans 
can  illustrate  the  use  of  Bayes  Law.  A  population  of  one 
thousand  persons  is  distributed  with  regard  to  a  symptom 
and  disease  as  follows: 

1.  100  patients  have  both  the  disease  and 

symptom 

2.  200  patients  have  the  disease  but  not  the 

symptom 

3.  300  patients  have  neither  the  disease  nor  the 


symptom 
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4.  400  patients  do  not  have  the  disease  but 

have  the  symptom. 

These  statements  are  equivalent  to: 


1. 

N  (S 

n 

D) 

=  100 

2. 

N  (S 

n 

D) 

=  200 

3. 

N  (S 

n 

D) 

=  300 

4. 

N  (S 

n 

D) 

=  400 

in  which  S  and  D  imply  respectively,  those  without  the 
symptom  and  those  without  the  disease.  It  is  simple  to 
compute : 

P  (D)  -  P  (D  f)  S)  +  P  (D  n  S) 

=  100/1000  +  200/1000 
=  0.3 

and , 

P(S)  =  100/1000  +  400/1000 
=  0.5 

Using  Equation  (1): 

P  (S/D)  =  P (S  n  D)/P (D) 

=  0.1/0. 3 

=  0.33 

To  determine  the  probability  of  a  patient  having  this 
disease,  with  the  knowledge  that  the  patient  has  the 
symptom,  one  can  use  Bayes  Law: 
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P(D/S)  =  P  (D)P (S/D)/P (S) 
=  0.3  x  0.33/0.g 
=  0.2. 


This  value  could  be  derived  by  direct  examination  of  the 
data  with  complete  agreement. 

Diseases  are  not  typically  represented  by  one  symptom 
but  more  likely  by  a  set  of  symptoms,  physical  findings 
or  laboratory  abnormalities.  Symptoms  are  better  seen 
as  complexes.  Instead  of  P(S),  it  is  more  appropriate 
to  employ  the  term  P  (S^ , , . . . , S^)  and  Bayes  Law  becomes: 

P(D/Sl,S2 . Sn)  -  P(D)-P(S1-S2 . VD) 


P  ^S1 ' S2 ' '  ’ *  ' Sn^ 


In  addition,  the  absence  of  a  symptom  is  as  important  to 
consider  as  its  presence.  The  probabilty  of  S ^  -  the 
symptom  being  absent  -  is  equal  to  l-P(Sj). 

In  the  simplest  of  clinical  situations,  one  is  concerned 
with  the  presence  or  absence  of  a  single  disease.  This 
is  equivalent  to  P (D)  +  P (D)  =1,  that  is  ,  the  patient 
either  has,  or  does  not  have  the  disease;  no  other  possi¬ 
bilities  are  tenable.  If  two  or  more  diseases  are  consid¬ 
ered,  the  patient  must  always  have  some  disease  or  be 
normal : 


P(D1)  +  P(d2)  +  •••  +  P  (Dn)  +  P (normal)  =  1. 

For  the  sake  of  simplification,  the  state  normal  is  either 
included  as  an  additional  disease  state  or  ignored. 


55 


The  terms  P(S)  can  also  be  expanded,  to: 


P(S)  =  P(S/D  )P(D  )  +  P(S/D  )P(D  )  +  ...  +  P(s/D  )P(D  ) 
I  -L  12  mm 


Combining  these,  Bayes  Law  becomes: 


P(VS1'S2 


'Sn/Di)P(Di> 


/  •  •  •  r 


P(S1,S2 


p (s1,s2, 

P(SrS2 


/  •  •  •  f 


S  /D9)P(D0)  +  ...  + 
n  z  z 

,Sn/Di)P(Di)  +  ...  + 


P(S1'S2 . VDm)P(V 


still  without  having  made  any  assumptions. 

To  apply  Bayes  Law  to  clinical  situations,  it  is 
necessary  to  know  only  the  following  information: 

1.  the  incidence  (or  prevalence,  if  appropriate) 
of  the  disease  within  the  population  being 
studied,  and, 

2.  the  probability  of  the  symptom  complex  being 
present  within  each  of  the  diseases  under 
consideration . 

The  first  is  accessible  through  the  medical  literature  or 
public  health  data.  If  the  population  under  consideration 
is  a  clinic  group,  the  data  can  be  predicted  from  personal 
experience.  The  second  is  somewhat  more  complex.  Medical 
literature  generally  contains  information  in  the  form 
of  P (S/D)  for  each  individual  symptom.  Only  rarely  does 
the  literature  propose  to  give  P(S^,S2/D),  the  occurence 


of  two  symptom  complexes  within  a  disease.  Even  rarer 
does  it  propose  to  deal  with  more  extensive  symptom 
complexes  for  any  disease. 

To  obtain  sufficient  data  would  require  a  review 
of  a  large  number  of  cases,  proportional  to  m  x  2n  for 
m  disease  and  n  symptoms,  each  having  only  two  possible 
states,  present  or  absent.  As  the  number  of  symptoms 
becomes  large  or  the  states  for  any  one  symptom  increases 
(for  example,  a  symptom  graded  as  absent,  mild,  moderate 
or  severe)  the  task  becomes  untenable:  the  required 
sample  would  quickly  exceed  all  human  population,  past 
or  present. 

To  simplify  this  task,  this  study,  as  well  as  all 
the  Bayesian  studies  in  the  literature,  assumes  the 
independent  sorting  of  symptoms  within  each  disease.  If 
independent,  the  incidence  of  a  set  of  symptoms  can  be 
expanded  to: 

P(S1'S2 . V  "  P(S1)P(S2)...P(Sn) 


and  Bayes  Law  becomes: 

P  (Di/S1,S2, . . . ,Sn) 


P  (Di)P  (S1/Di)P (S2/Di)  .  .  .P  (Sn/Di) 
P(D1)P(S1/D1)P(S2/D1) . . .P(Sn/D1) 

P  (D2)P (S1/D2)P (S2/D2)  . .  .P  (Sn/D2) 

P(Dm)P(Sl/Dm)P(S2Dm)-*-P(W 


Theoretically,  this  cannot  be  true,  as  demonstrated  by 

"Two  symptoms  S.^  and  S2  relevant  to  a  disease  D 


Lusted . 
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cannot  be  independent  of  each  other  unconditionally 
since  the  truth  or  falsehood  about  the  hypothesis  of 
what  disease  causes  and  S will  cause  the  symptoms, 

3 

if  they  are  indeed  diagnostically  relevant,  to  co-vary." 

4 

Warner  dealt  with  this  problem  through  the  mathe¬ 
matical  demonstration  that  for  a  symptom  to  be  truly 
independent  of  another  woud  require  a  uniform  distri¬ 
bution  through  the  population.  Consequently,  the  symptom 
would  be  of  no  diagnostic  value.  The  use  of  non-inde¬ 
pendent  symptoms  is  valid  if  the  correlation  "is  due 
only  to  the  non-uniform  distribution  of  in  diseases 

y-^,y0,...,y^  and  not  due  to  a  direct  causal  relationship 

5 

between  x  and  x,  . " 

a  b 

Lodwick  states  "Our  version  of  Bayes  rule  assumes 
that  our  predictor  findings  are  independent,  which  is 
to  say  that  information  regarding  the  presence  or  absence 
of  one  finding  does  not  provide  information  regarding  the 
probability  of  the  presence  or  absence  of  any  other 
findings.  One  can  at  best  only  demonstrate  correlation 
between  findings  in  a  given  sample.  The  absence  of 
correlation  merely  suggests  the  possibility  of  indepen¬ 
dence;  it  can  bv  no  means  be  accepted  as  proof.  Thus, 

we  would  appear  to  be  on  theoretically  shaky  grounds  to 

6 

use  Bayes  theorem  in  the  form  we  use  it.'1 
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In  practice,  independence  of  symptoms  can  be 
tested  through  chi-squared  analysis.  If  present,  highly 
correlated  symptoms  can  be  combined;  symptoms,  which 
are  causally  related,  can  be  eliminated.  For  example, 
it  would  be  in  error  in  a  congenital  heart  disease 
model  to  employ  both  cyanosis  and  p0?  levels;  their 
correlation,  while  dependent  on  other  factors,  is  too 
high  to  allow  their  simultaneous  use. 

Alternatively,  a  complicated  mathematical  procedure 

7 

can  be  employed.  Brunk  and  Lehr,  using  the  Gram 
Schmidt  Or thoginalization  Procedure,  discard  symptoms 
with  a  high  correlation.  Those  with  weaker  correlations 
selected  individually  and  a  linear  function  of  one  variable 
is  subtracted  from  another  so  that  they  are  no  longer 
correlated.  Then  another  variable  is  selected,  the  same 
function  performed,  until  a  set  of  uncorrelated  variables 
is  obtained. 

Despite  this  limitation,  Lusted  warns  against  ignoring 
Bayes  Law.  "How  seriously  should  you  take  your  inability 
to  prove  that  symptoms  are  independent?  One  answer  is 
that  you  should  not  take  it  seriously  enough  to  discourage 
you  from  using  Bayesian  procedures  to  study  medical  diag¬ 
nosis.  You  can  assume  that  symptoms  are  conditionally 
independent  even  when  you  have  reason  to  suppose  they 
are  not.  ...,  but  you  should  not  take  the  situation  too 
glibly.  Watch  for  violations  of  conditional  independence 
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which  are  so  severe  that  they  may  lead  you  to  major 
errors  in  diagnosis  .  ..."8 


To 

illustrate  the 

use  of 

Bayes  Law, 

a  symptom 

disease 

matrix  is  shown 

below . 

Incidence 

P  (s1) 

P  (S 2 ) 

p(s3) 

D 

0.25 

.  2 

.  3 

.  9 

D 

0.75 

.  7 

.  4 

.  4 

A  patient  under  consideration  has  symptoms  and 

but  not  S2-  Using  Bayes  Law: 

P(D)P(S  ,S  ,S  /D) 

P(D/D  ,S  ,S  )  =  - ±— ± - 

p (d)p (s1,s2,s3/d)  +  p (d)p  (si;s2,s3/d) 

P (D)P (S1/D)P (S2/D)P (S3/D) 

p(d)p(s1/d)p(s2/d)p(s3/d)  + 

P (D) P (S1/D) P (S2/D) P (S3/D) 


Remembering  that  P(S)  is  equal  to  1  -  P(S)  : 


p  (d/s1,s2#s3) 


(.25)  (.2)  ( 1- . 3 )  (.9) 

(.  25)  (.  2)  (1-.  3)  (.  9)  +  (.  75)  (.7)  (1-.4)  (.4) 


=  0.2 


Similarly , 

P  (D/S1,S2,S3)  =  P (D)P (S1,S2,S3/D) 

P  (D)P  (S1,S2,S3/D)  +  P (D)P (S1,S2,S3/D) 

(.75)  (.7)  (1-.4)  (.4) 

'  (.75)  (.  7)  (1-.4)  (.4)  +  (.25)  (.  2)  (1-.3)  (.9) 


0.8 
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Thus,  the  probability  of  the  disease,  given  these 
symptoms,  is  0.2  (twenty  percent)  and  the  probability 
of  the  disease  not  being  present  is  0.8  (eighty  percent). 

One  final  comment  on  Bayes  Law  relates  to  the 
simultaneous  occurrence  of  two  diseases.  The  data  in 
the  symptom-disease  matrix  relates  only  to  each  individual 
disease.  Bayes  Law  does  not  permit  assumptions  regarding 
the  simultaneous  occurrence  of  two  diseases:  this  case 
must  be  handled  as  a  new  disease  pattern.  Neither  a 
ventricular  septal  defect  nor  pulmonary  stenosis  alone 
can  cause  cyanosis,  yet  in  combination  they  are  likely 
to  do  so.  A  Bayesian  model  employing  the  two  diseases 
separately  is  incapable  of  diagnosing  the  combined 
condition:  to  diagnosis  a  VSD  with  pulmonary  stenosis 
would  require  data  on  this  third  disease. 

In  the  following  chapters,  a  model  employing  Bayes 
Law  for  the  diagnosis  of  neonatal  heart  disease  will 
be  described.  In  considering  the  model  and  its  use  of 
Bayes  Law,  it  is  important  to  remember  that  independence 
of  variables  was  assumed  to  allow  one  to  expand  Bayes 
Law  to  its  final  form.  Beyond  this,  Bayes  Law  is  of 
universal  validity:  it  allows  its  user  to  predict  the 

likelihood  of  a  disease  from  a  set  of  symptoms.  The 
only  information  required  for  this  prediction  is  the 
incidence  of  each  disease  under  consideration  and  the 
probability  of  each  symptom  within  each  disease. 
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CHAPTER  IV 


METHODOLOGY 

With  the  development  of  high-speed  electronic 
computers,  it  became  practical  to  apply  Bayes  Law  to 
the  solution  of  complex  clinical  problems.  As  mentioned 
previously,  models  have  been  developed  for  the  diagnosis 
of  congenital  heart  disease,  the  acute  abdomen,  and 
radiological  evaluation  of  bone  tumors.  In  addition, 

studies  have  been  performed  in  the  areas  of  thyroid 

1  2 
disease  and  the  diagnosis  of  Cushing's  syndrome. 

Although  the  area  of  congenital  heart  disease  was 

well  studied  by  Warner,  Veasy  and  Toronto,  their 

diagnostic  model  involved  patients  ranging  in  age  from 

one  month  to  twenty  years.  Another  population,  those 

who  display  their  initial  symptoms  in  the  period  between 

birth  and  one  month  were  excluded.  A  great  number  of 

patients  with  congenital  heart  disease,  particularly 

those  with  the  most  severe  disorders,  are  seen  in  the 

neonatal  period.  More  than  one-fourth  of  all  infants 

with  congenital  heart  defects  die  before  the  age  of 

3 

one  month,  most  within  the  first  week. 

It  has  been  the  purpose  of  this  study  to  apply  the 
techniques  of  Bayesian  analysis  to  cardiological  diagnos 
in  the  newborn  period.  The  area  is  well  suited  to  a 


63 


predictive  model:  diagnosis  is  based  primarily  on 
physical  findings  and  radiographic  and  electrocardio¬ 
graphic  data  rather  than  the  more  subjective  history. 

Also,  as  the  majority  of  patients,  other  than  those 
with  small  PDAs  and  pulmonary  disease,  are  catheterized , 
clinical  diagnoses  can  be  checked  by  objective  data. 

Development  of  a  Bayesian  statistical  model  requires 
a  number  of  steps.  One  must  specify  the  clinical  problem 
and  identify  the  patient  population.  Next,  one  must 
decide  upon  the  relevant  symptoms,  physical  findings 
and  laboratory,  electrocardiographic  and  radiographic 
data  and  prepare  a  comprehensive  list  of  diseases  which 
occur  in  the  population.  The  Bayesian  system  then 
depends  upon  completion  of  the  symptom-disease  matrix: 
data  about  the  incidence  of  each  disease  within  the 
population,  and  the  occurrence  of  each  symptom  within 
each  disease.  Using  a  computer  program  to  tabulate  the 
large  quantity  of  data  under  consideration,  a  number  of 
cases  must  be  analyzed  to  determine  the  validity  of  the 
system. 

The  clinical  problem  in  this  thesis  is  congenital 
heart  disease  in  the  neonate.  The  population  which  has 
been  selected  was  arbitrarily  limited  to  any  patient 
admitted  to  the  Yale-New  Haven  Hospital  Newborn  Special 
Care  Unit.  While  this  may  exclude  a  few  neonates  admitted 
to  a  different  hospital  unit  or  in  rare  cases  may  include 
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patients  beyond  the  first  month  of  life,  it  has  been 
necessary  to  adhere  to  these  limits  since  the  symptom- 
disease  was  constructed  with  regard  to  that  population. 
Each  of  the  other  components  will  be  discussed  in  its 
approach  section. 

Symptoms 

To  assess  the  symptoms  relevant  to  the  diagnosis 

of  congenital  heart  disease,  charts  of  patients  evaluated 

for  cardiac  disease  in  the  neonatal  period  were  reviewed. 

Comparing  these  to  the  diagnostic  workup  suggested  by 

4 

Talner  and  Campbell,  it  was  possible  to  compile  a  list 
of  pertinent  findings.  Historical  data  included  prenatal 
conditions  such  as  infection,  hypertension  or  drug  use, 
length  of  gestation,  and  other  family  history  of  congen¬ 
ital  abnormalities.  On  physical  exam,  in  addition  to 
the  vital  signs,  observations  were  made  on  the  pulse 
quality  (intensity  in  all  extremities),  liver  size, 
autonomic  activity  and  skin  color.  Cardiac  exam  included 
evaluation  of  thrills  or  heaves,  and  auscultation  for 
the  quality  of  the  first  and  second  heart  sound,  the 
presence  of  gallops  or  clicks  and  the  location,  timing, 
quality  and  intensity  of  murmers.  Laboratory  evaluation 
included  hemoglobin,  hematocrit,  acid-base  status  and 
blood  gases  in  room  air  and  with  increased  oxygen  tension. 
A  chest  roentgenogram  permitted  evaluation  of  heart  size, 
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pulmonary  vasculature  and  lung  fields,  possible  chamber 
enlargement,  and  on  rare  occasions,  abnormalities  in  the 
location  of  abdominal  viscera,  the  heart  and  the  aorta. 
Electrocardiographic  data  included  conduction  patterns, 
chamber  hypertrophy,  and  direction  of  electrical  forces. 

Several  criteria  must  be  met  before  these  symptoms 
can  be  employed  in  a  statistical  model.  Limited  by 
the  ability  of  digital  computers  to  handle  only  numerical 
data,  symptoms  must  be  organized  into  discrete  units. 

For  example,  a  white  count,  elevated  or  normal,  or  plantar 
reflexes,  up  or  down,  are  distinct  possibilities.  Find¬ 
ings  which  fall  into  subjective  categories  -  for  example, 
the  degree  of  pain  in  arthritis  -  either  require  special 
handling  or  must  be  omitted.  Modifications  might  involve 
definitions  of  the  gradations.  Pain,  if  strictly  defined 
into  categories  of  absent,  mild,  moderate  or  severe, 
represents  discrete  categores. 

Second,  if  a  symptom  is  to  be  included,  it  must 
have  a  known  relationship  to  the  diagnosis.  For  example, 
a  history  of  maternal  infection  other  than  rubella  may 
increase  the  likelihood  of  congenital  heart  disease, 
but  may  make  no  contribution  to  the  specific  diagnosis. 

Third,  definite  and  consistent  criteria  must  be 
established  for  each  variable.  Cardiac  size  is  difficult 
to  assess  in  the  neonatal  period.  To  use  such  a  symptom 
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requires  establishment  of  specific  standards. 

Fourth,  it  is  important  that  data  be  organized 
in  the  form  it  is  obtained.  For  example,  murmurs  may 
be  defined  as  "loud"  or  "soft"  by  standardized  phono- 
cardiographic  deflection.  If  a  phonocardiogram  is  not 
routinely  obtained,  the  significance  of  loud  and  soft 
may  be  lost. 

Finally,  one  must  remember  Lusted' s  admonition  to 
avoid  related  symptoms.  If  arterial  desaturation  is 
used,  cyanosis  must  be  omitted.  Desaturation,  while  not 
completely  correlated  with  skin  color,  is  both  the  cause 
of  and  highly  related  to  cyanosis. 

With  the  assistance  of  three  experts  in  neonatal 
cardiology,  Dr.  Norman  Talner  and  Dr.  Marie  Browne  of 
the  Department  of  Pediatrics,  and  Dr.  Allan  Simon  of 
the  Department  of  Radiology,  it  was  possible  to  select 
the  general  categories  of  data  which  were  utilized  in 
the  diagnosis  of  neonatal  cardiological  disease.  For 
many  of  the  observations,  the  separation  between  normal 
and  abnormal  was  apparent.  The  location  of  the  aorta, 
if  observed,  could  only  be  left  or  right.  A  patient 
could,  for  all  practical  purposes,  be  only  male  or 
female.  For  other  observations,  particularly  those 
employing  quantification  as  in  pulse  rate,  oxygen 
saturation,  etc.,  it  was  necessary  to  establish  cut-offs 
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between  normal  and  abnormal.  Upon  examination  of  a 
large  number  of  charts,  it  became  apparent  that  certain 
groupings  of  data  occurred  within  each  disease  and 
often  these  groupings  were  consistent  between  diseases. 

For  example,  in  examining  the  data  on  the  age  of  the 
patient  at  initial  presentation,  it  was  readily  apparent 
that  certain  diseases  manifested  early  in  life  -  aortic 
atresia  (hypoplastic  left  heart) ,  persistent  fetal 
pathways,  and  transposition  of  the  great  arteries. 

Others,  such  as  aortic  or  pulmonary  stenosis,  Tetralogy 
of  Fallot  or  Ebstein's  abnormality,  generally  presented 
later  in  the  newborn  period.  The  age  which  separated 
the  early  from  the  late  group  was  approximately  three 
days  of  age:  while  some  patients  with  valvular  atresia 
presented  later  in  life  and  others  with  stenosis  presented 
earlier,  it  was  possible  to  separate  them  best  using  the 
age  three  days. 

For  arterial  desaturation,  three  groups  of  values 
were  apparent:  the  lowest  values  were  present  in  the 
transposition  complexes,  significant  desaturation  was 
present  in  right  to  left  shunts,  and  minimal  or  no  desatur¬ 
ation  was  present  in  other  patients.  Limited  by  the 
availability  of  data,  it  became  necessary  to  group  pC>2 
values  into  only  two  groups,  those  with  a  pC>2  less  than 
45  and  those  above.  In  the  absence  of  blood  gas  values. 
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the  appearance  of  generalized  cyanosis  (not  just 
perioral)  was  taken  to  correlate  approximately  with 
the  lower  range  of  arterial  oxygenation,  an  assumption 
which  could  be  inaccurate  in  the  face  of  variations  of 
hemoglobin  concentrations. 

Certain  data,  while  reported  consistently  in  the 
charts,  was  unable  to  satisfy  the  criteria  established 
previously.  For  example,  the  "quality"  of  the  first 
heart  sound,  while  consistently  noted  in  the  charts, 
did  not  lend  itself  to  use.  The  frontal  axis  of  the 
p-wave,  while  frequently  noted,  was  never  abnormal, 
and  consequently  provided  no  data  base. 

The  greatest  difficulty  arose  in  the  selection  of 

criteria  for  ventricular  hypertrophy.  Great  variation 

exists  in  electrocardiographic  ventricular  patterns 

during  the  newborn  period.  Absolute  criteria  for  hyper- 

5 

trophy  patterns  are  available  in  the  literature,  but 
are  so  restrictive  that  patients  documented  to  have 
hypertrophy  may  not  satisfy  the  criteria.^  To  surmount 
this  difficulty,  the  R  to  S  ratio  in  was  selected 
as  an  indicator  of  the  relative  presence  of  ventricular 
forces.  While  patients  with  an  increased  or  decreased 
R  to  S  ratio  may  or  may  not  be  considered  clinically  to 
have  hypertrophy;  if  the  symptom-disease  matrix  is  organized 
in  the  identical  format  rather  than  in  terms  of  left. 
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or  bi-ventricular  hypertrophy,  operation  of  the  Bayesian 
system  is  not  disturbed. 

Similar  choices  were  made  for  each  of  the  other 

pieces  of  data.  While  controversy  often  arose  and 

certain  omissions  were  necessary  -  for  example,  the 

presence  of  Down's  Syndrome  may  be  very  important  in 

suggesting  an  endocardial  cushion  defect  -  the  data 

selected  included  the  majority  considered  in  the 

clinical  diagnosis.  Another  omission  involves  the 

presence  of  chamber  enlargement  on  chest  roentgenogram: 

a  consulting  radiologist  with  experience  in  neonatal 
7 

angiography  suggested  that  specific  chamber  enlargement 
cannot  be  ascertained  accurately  in  the  newborn  period. 

Echocardiographic  data  have  also  been  omitted.  While 
echos  may  be  very  valuable  in  determining  the  presence 
of  a  valve  or  the  size  of  a  chamber,  the  use  of  echo¬ 
cardiography  is  not  sufficiently  standardized,  nor  is  there 
adequate  data  available  to  incorporate  it  into  a  study  of 
this  kind.  Later,  as  sufficient  data  is  collected,  it 
may  be  possible  to  incorporate  these  findings  into  a 
Bayesian  model. 

To  simplify  the  eventual  computations,  the  symptoms 
were  organized  into  the  format  of  choices,  for  example, 
gestational  age  was  divided  into  premature  or  term,  the 
second  heart  sound  as  single  or  split  and  pulses  as 
normal,  increased  or  decreased.  A  patient  can  have  only 
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one  of  these  options,  or  if  the  observation  was  omitted, 
none . 


Disease  and  Incidence  Data 

A  number  of  series  of  congenital  heart  disease 

appear  in  the  literature:  several  quantify  consecutive 
8 

autopsies,  others  involve  catheterization  or  clinical 
8 

data.  Through  these  and  review  of  one-hundred  ninety 
charts  at  Yale  New  Haven  Hospital,  it  was  possible  to 
compile  the  heart  diseases  of  the  newborn  period.  A 
complete  listing  is  very  extensive,  involving  numerous 
combinations  of  defects,  and  many  rare  abnormalities. 

But,  it  can  be  greatly  simplified  if  defects  are  organized 
collectively;  for  example,  the  presence  of  TGA  (Trans¬ 
position  of  the  Great  Arteries)  in  association  with 
VSD  is  not  dissimilar  from  TGA  with  an  atrial  septal 
defect.  In  both  cases,  the  patients  have  "Transposition 
Complexes"  and  can  be  grouped  together  under  the  diagnosis 
TGA.  Patients  with  a  VSD  in  association  with  either  an 
ASD  or  PDA  frequently  display  symptoms  of  their  predominant 
shunt,  usually  at  the  ventricular  level,  and  are  symptom¬ 
atically  not  unlike  patients  with  a  VSD  alone. 

The  final  selection  of  diseases  (Figure  18 )  reflected 
several  criteria: 

1.  the  disease  must  represent  a  specific  entity 


DISEASES 


1.  Pulmonary  atresia. 

2.  Pulmonary  stenosis. 

3.  Tricuspid  atresia  with  a  VSD. 

4.  Tricuspid  atresia  without  a  VSD. 

5.  Truncus  arteriosus. 

6.  Tetralogy  of  Fallot. 

7.  Ebstein's  disease. 

8.  Patent  ductus  arteriosus. 

9.  Ventricular  septal  defect. 

10.  Endocardial  cushion  defect. 

11.  Transposition  of  the  great  arteries. 

12.  Anomolous  pulmonay  venous  drainage. 

13.  Coarctation  with  a  VSD. 

14.  Coarctation  of  the  aorta. 

15.  Hypoplastic  left  heart. 

16.  Aortic  stenosis. 

17.  Normal. 

18.  Primary  pulmonary  disease. 

19.  Persistent  fetal  pathways. 

20.  Primary  myocardial  disease. 


FIGURE  18 


72 


and  if  it  included  two  or  more  diseases  the 
symptoms  within  each  must  be  similar  (thus, 
coarctation  of  the  aorta  was  separated  into 
those  with  and  those  without  a  VSD) 

2.  The  incidence  of  the  disease  must  be  sufficient 
to  permit  the  accumulation  of  adequate  data  - 
very  rare  diseases  such  as  tri-atrial  heart 
were  omitted. 

3.  where  combinations  of  diseases  resulted  in  a 
new  entity  it  had  to  be  included  separately  - 
for  example  the  combination  of  pulmonary 
stenosis  with  a  ventricular  defect  is  the 
Tetralogy  of  Fallot,  an  entity  distinct  from 
its  components. 

The  list  includes  two  non-cardiac  entities.  "Primary 
pulmonary  disease"  represents  those  patients  with  no 
intrinsic  cardiac  defect  whose  symptoms,  tachypnea, 
tachycardia,  murmurs  or  cyanosis,  are  referrable  to  a 
pulmonary  process.  In  the  category  "normal"  are  patients 
with  neither  anatomic  cardiac  defects  nor  pulmonary 
disease  who  are  evaluated  for  arrythmias,  transient  cyanosis 
or  prematurity.  Myocardial  disease  involves  myocardopathies 
of  metabolic  origin  or  coronary  artery  abnormalities  which 
produce  myocardial  ischemia. 


' 
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Series  from  the  literature  were  inappropriate  for 
use  as  incidence  data.  Neither  autopsy  series  nor 
catheterization  data  correlated  exactly  with  the 
population  under  consideration.  The  most  appropriate 
data  was  obtained  from  records  maintained  by  the  Section 
of  Pediatric  Cardiology.  The  "Regional  Infant  Care 
Program"  log  included  all  patients  catheterized  or 
autopsied  during  a  several  year  period  and  for  a  brief 
period,  all  the  patients  consulted  by  the  service, 
whether  or  not  an  anatomical  diagnosis  was  ever  obtained. 
Through  analysis  of  these  records,  it  was  possible  to 
determine  the  incidence  of  each  of  the  defects  over  a 
several  year  period.  The  incidence  of  "normal", 

"primary  pulmonary  disease"  and  PDAs,  three  diagnoses  for 
which  patients  were  frequently  not  catheterized,  was 
computed  from  the  brief  period  when  all  consults  were 
recorded.  Data  on  the  remaining  disease  was  tabulated 
from  the  complete  series  of  catheteriztion  or  autopsy 
patients . 

Symptom-Disease  Matrix 

To  complete  the  symptom-disease  matrix,  three 
sources  were  employed.  First,  one-hundred  ninety  charts 
of  patients  previously  evaluated  by  the  pediatric 
cardiologists  were  examined.  Second,  the  literature 


was  reviewed,  but  little  of  data  existed  in  the  format 

required  by  a  Bayesian  analysis.  The  sex  distribution  - 

male  to  female  ratio  for  each  disease  -  varies  little 

between  sub-populations  and  can  be  abstracted  from  the 
9 

literature.  Some  data  on  PR  and  QRS  intervals  appeared 

10 

in  a  text  on  electrocardiographic  data.  Radiographic 
data  on  aortic  arch  location  was  readily  available  from 
the  literature  in  a  format  applicable  to  this  study."*’’*’ 

The  third  source  was  the  three  clinicians  mentioned 
previously.  Data  on  history  and  physical  findings  was 
obtained  from  Dr.  Talner.  Electrocardiographic  and 
ausculatory  findings  were  contributed  by  Dr.  Browne. 
Radiographic  data  was  provided  by  Dr.  Simon. 

With  these  three  sources  of  data,  it  was  possible 
to  perform  the  initial  crude  analyses.  Reviewing  the 
one-hundred  ninety  cases,  and  comparing  each  to  the 
three  sets  of  data,  a  final  symptom  disease  matrix  was 
compiled  which  appears  in  Appendix  3  .  a  casual 

glance  at  the  data  will  reveal  some  apparent  errors. 

Many  of  the  symptom  complexes  total  greater  than  100% 
resulting  form  the  absence  of  0.00  as  an  entry  anywhere 
in  the  matrix.  Instead  the  value  0.01  or  greater  is 
employed  as  a  "default"  value  for  a  symptom  generally 
absent  from  a  disease.  For  others,  symptoms  felt  to  be 
distinct  clinically  were  combined:  for  example,  heart 
size,  while  recorded  as  normal,  moderately  or  markedly 
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enlarged,  is  dealt  with  statistically  as  normal  or 
enlarged.  For  other  symptoms,  all  the  choices  are  given 
an  equivalent  value  -  .50  or  fifty  percent.  This 
reflects  the  lack  of  significance  of  these  variables. 

For  example,  the  presence  or  absence  of  a  third  heart 
sound  may  be  important  to  a  clinical  diagnosis.  However, 
the  absence  of  sufficient  data  and  the  difficulty  of 
auscultation  of  a  third  heart  sound  eliminated  it  as 
statistically  significant.  In  neonatal  cardiology, 
or  in  any  area  of  medical  diagnosis,  the  differences 
between  observers  may  play  an  important  role  in  diagnosis. 
For  example,  the  difficulty  of  the  auscultatory  exam  of 
a  tachycardic  newborn  may  yield  spurious  data.  If  the 
second  heart  sound  is  reported  as  single  the  clinician 
is  more  prone  to  such  diagnoses  as  valvular  atresia, 
transposition  or  truncus  arteriosus  rather  than  a  septal 
defect  or  patent  ductus.  A  more  careful  examination  may 
reveal  the  presence  of  two  components  to  that  sound, 
suggesting  a  different  group  of  diagnoses. 

Clinicians  could  minimize  the  extent  of  observer 

variation  and  its  effect  on  diagnosis  were  it  not  for 

12  13 

many  studies  such  as  one  by  Yerushalmy  in  the  area  of 

radiology.  A  group  of  physicians  were  presented  with 

roentgenograms  which  they  were  to  evaluate  for  the 

presence  of  pulmonary  tuberculosis.  The  interpreters 

were  expected  to  read  the  films  by  their  usual  criteria; 
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none  were  defined.  All  the  clinicians  missed  approxi¬ 
mately  twenty-five  percent  of  the  positive  films.  When 
the  films  were  reviewed  by  the  same  observers  several 
months  later,  the  same  reader  was  apt  to  change  his 
diagnosis  for  about  one  of  every  five  positive  cases. 

Of  particular  importance  to  neonatal  cardiological 
diagnosis  is  the  evaluation  of  pulmonary  vasculature. 

The  lung  fields  could  demonstrate  a  number  of  patterns 
including  decreased,  normal  and  increased  flow,  and 
a  pattern  of  " f low-f ailure"  or  venous  obstruction. 
Observers  generally  classify  the  vasculature  into  one 
of  these  categories  with  such  qualifieds  as  "markedly" 
or  "upper"  or  "lower  limits".  Different  clinicians 
will  review  the  films,  both  with  and  without  clinical 
data,  and  make  different  observations.  In  Arnois' 
study  ^  three  physicians  from  a  Department  of  Cardiology 
and  Radiology,  all  experienced  in  congenital  cardiac 
disease,  were  shown  128  films  on  three  occasions.  The 
films  were  selected  so  that  32  demonstrated  decreased 
and  32  increased  vascularity,  all  documented  by  catheter 
ization;  the  remaining  64  were  normal.  The  clinicians 
were  shown  the  masked  lung  fields  on  two  occasions; 
on  the  final  viewing  the  masking  was  removed.  "The 
great  majority  of  errors  was  made  in  differentiating 
between  normal  and  decreased  vasculature  ....  In  contra 
distinction,  the  differentiation  between  normal  and 
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increased  vasculature  was  good.  An  infrequent  but 
interesting  error  consisted  in  the  scoring  of  decreased 
vascularity  as  increased.  It  is  of  particular  interest 
that  this  error  occurred  only  in  cases  of  the  Tetralogy 
of  Fallot.  The  converse  -  scoring  of  increased  vascu¬ 
lature  as  decreased  -  did  not  occur  in  this  study. 

The  total  number  of  errors  in  the  group  of  patients 

with  pulmonary  undervascularity  was  136  out  of  288 

15 

readings  or  47  percent." 

These  data  would  suggest  that  differentiation 
between  normal  and  decreased  flow,  while  important 
clinically,  is  a  difficult  radiographic  observation. 

For  the  purposes  of  this  study,  no  attempt  has  been 
made  in  the  data  base  to  employ  the  distinction.  If 
a  clinician  observed  decreased  flow  it  is  placed  in 
the  category  of  decreased-normal  flow.  Similarly, 
the  distinction  between  increased  flow  and  "flow-failure" 
was  difficult  and  these  categories  were  combined. 

The  evaluation  of  murmurs  is  similarly  difficult. 
Feinstein,  in  the  Prologue  to  Clinical  Judgment,  describes 
his  encounter  with  observer  variation  while  directing 
a  rheumatic  fever  prophylaxis  study: 

Soon  after  I  began  this  new  work,  while  making 
ward  rounds  one  day,  I  heard  a  faint,  but 
unequivocal  diastolic  murmur  along  the  left 
sternal  border  of  a  patient  in  whom  no  murmurs 
had  been  noted  either  by  the  resident  physician 
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of  our  hospital  or  by  the  physicians  who 
had  just  referred  the  patient  to  us.  After 
I  demonstrated  the  murmur  to  our  resident 
physician,  he  agreed  that  it  was  there  and 
that  he  had  failed  to  recognize  it. 

The  correction  of  the  resident's  error 
was  a  simple  event  -  part  of  the  ordinary 
daily  routine  of  clinical  activities  and  train¬ 
ing.  Yet,  as  I  later  thought  about  the  event, 
it  assumed  greater  significance.  Since  the 
resident  physician  and  I  might  have  been  the 
only  two  doctors  who  were  going  to  listen  to 
this  patient's  heart  at  our  institution  the 
murmur  would  have  been  undetected  had  I  not 
found  it.  The  patient  would  have  been  dis¬ 
charged  with  the  same  diagnosis  of  "no  heart 
disease"  with  which  whe  had  arrived.  If  the 
murmur  persisted,  it  would  probably  be  found  at 
some  later  date  by  another  auscultator.  Since 
the  murmur  would  never  have  been  cited  previously, 
however,  the  new  auscultator  might  falsely  conclude 
that  the  murmur  had  arisen  from  insidious  scarring 
of  an  aortic  valve  whose  damage  has  previously 
been  clinically  imperceptible. 

The  insidious  development  of  scarring  in 
valves  that  initially  seemed  undamaged  has  long 
been  regarded  as  a  major  pathogenetic  mechanism 
in  rheumatic  heart  disease.  I  began  to  wonder 
how  many  patients  might  have  developed  such 
"scarring"  as  a  fallacy  of  clinical  auscultation. 

De  novo  rheumatic  heart  disease  -  absent  on  initial 
examination  of  a  rheumatic  patient  and  found  in 
another  examination  some  time  later  -  might  certainly 
occur  by  insidious  scarring  of  a  valve,  but  could 
also  be  "created"  by  iatrogenic  mechanisms:  the 
abnormal  murmur  might  have  been  present  on  both 
ausculatory  examinations  although  undetected  initially; 
or  it  might  have  been  absent  on  both  examinations, 
but  erroneously  diagnosed  as  present  on  the  second. 

The  whole  concept  of  insidious  de  novo  rheumatic 
scarring  seemed  to  depend  on  clinical  auscultation 

of  the  heart,  and  yet  auscultators  could  sometimes 
,  16 
be  wrong. 


In  reviewing  charts  of  patients  evaluated  for 
congenital  heart  disease,  it  was  apparent  that  murmurs 
were  recorded  differently  by  different  observers.  Beyond 
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the  highly  subjective  grading  of  loudness,  differences 
involve  type  -  holosystolic  versus  ejection  -  as  well  as 
location  -  upper  versus  lower  versus  entire  sternal 
border,  for  example.  Allowance  for  this  type  of  variation 
required  significant  changes  in  the  matrix.  For  example, 
a  left  upper  sternal  border  ejection  murmur  is  not 
characteristic  of  a  PDA.  In  several  patients  in  the 
series,  this  was  the  murmur  recorded  by  the  clinician. 

If  the  symptom-disease  matrix  contained  the  value  0.00 
for  the  incidence  of  that  murmur  with  a  ductus,  Bayes 
Law  could  provide  a  probability  of  zero  percent  for 
that  diagnosis.  To  avert  this  situation,  the  "default" 
value,  for  each  entry  is  never  set  at  zero,  but  at  least 
0.01,  unless  the  presence  of  a  symptom  if  pathognomonic 
of  the  absence  of  the  disease.  The  level  of  the  default 
value  will  reflect  the  vagueness  of  the  symptom. 

Murmurs  are  particularly  difficult  to  evaluate  in  this 
population  and  ejection  murmurs  at  the  left  sternal 
border  are  so  frequently  reported  that  they  are  of  little 
diagnostic  significance.  Thus,  the  default  value  for 
ejection  murmurs  at  the  left  sternal  border  must  be 
set  high,  0.05,  so  that  their  presence  eliminates  no 
diagnoses.  Diastolic  murmurs  are  less  frequently 
recorded  so  that  the  default  level  has  been  set  at  two 
percent . 


■ 


80 


Even  the  evaluation  of  differential  pulses,  while 
more  precise,  must  allow  for  some  observer  or  disease 
variation.  For  several  patients  in  this  series,  differ¬ 
ential  pulses  were  recorded  by  at  least  one  observer 
but  at  catheterization,  no  cardiovascular  abnormality 
would  be  demonstrated  to  explain  the  finding. 

The  use  of  default  value  increases  the  total  for 
several  symptoms  to  greater  than  1.00.  While  theoretic¬ 
ally  impure,  this  introduces  no  strain  to  a  Bayesian 
system.  To  eliminate  the  increased  totals,  one  need 
only  normalize  the  values  for  each  symptom,  i.e.,  divide 
each  entry  by  the  total  for  that  symptom  in  that  disease. 
In  practice,  the  denominator  of  Bayes  Law  acts  as  a 
normalizing  factor  and  the  use  of  totals  greater  than 
one  introduces  minimal  error  to  the  final  diagnosis. 

Data  Collection 

Patient  data  for  the  testing  of  the  Bayesian  system 
was  collected  at  three  stages.  The  first  group  of  one- 
hundred  ninety  cases  were  selected  from  consecutive 
listings  of  the  Regional  Infant  Care  Program.  The 
second  group,  fifty  patients,  was  obtained  by  further 
review  of  these  listings.  An  additional  thirty  charts 
were  requested  for  these  two  samples  but  were  not 
included  in  the  study.  In  several  cases,  the  patient 
expired  either  prior  to  arrival  at  the  hospital  or  before 
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a  complete  evaluation  by  the  hospital  staff.  For  the 
remainder,  the  charts  were  unobtainable.  Some  had 
been  lost,  while  others  had  been  signed  out  to  hospital 
staff  as  long  as  six  months  earlier  and  never  returned. 
The  diagnoses  of  these  patients  were  obtained  from  the 
files  of  the  Section  of  Pediatric  Cardiology.  As  the 
distribution  of  diagnoses  was  similar  to  that  of  the 
remainder  of  the  population,  it  was  felt  that  omission 
of  these  patients  would  not  compromise  the  study. 

The  charts  of  the  patients  were  abstracted  into  a 
form  similar  to  the  one  shown  in  Appendix  1  .  If 
available,  the  evaluation  by  the  most  senior  cardiologist 
who  saw  the  patient  prior  to  catheterization  was  employed 
In  the  few  cases  where  the  cardiologists  did  not  enter 
a  complete  note  prior  to  catheterization  or  death,  the 
most  senior  housestaff's  evaluation  was  used.  If  in  the 
chart  or  the  files  maintained  by  the  cardiologists,  the 
electrocardiogram  was  observed  for  the  pertinent  findings 
When  absent,  this  information  was  obtained  from  the 
clinician's  note.  In  some  cases,  the  clinicians  reported 
only  the  QRS  axis  and  ventricular  patterns  so  that  data 
on  PR  and  QRS  intervals  was  omitted  from  the  analysis. 
Approximately  half  of  the  chest  roentgenograms  were 
reviewed  by  the  author  with  assistance  from  Dr.  Simon. 

For  those  films  which  were  unobtainable,  the  clinician  s 
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observations  were  employed. 

Greater  than  ninety  percent  of  these  patients 
had  undergone  catheterization  so  anatomical  diagnoses 
were  generally  available.  For  those  patients  not 
cathed,  clinical  diagnoses  were  recorded.  It  is  signif¬ 
icant  to  note  that  these  patients  represent  a  sub¬ 
grouping  of  the  population  under  consideration.  Since 
the  majority  had  been  catheterized ,  the  sample  is 
skewed  towards  the  more  serious  defects  requiring 
diagnostic  study,  and  away  from  the  patients  with  PDAs 
or  other  less  serious  heart  defects. 

Computer  Program 

The  large  number  of  calculations  required  by  Bayes 
Law  would  discourage  the  user  from  performing  the  analysis 
manually.  Through  the  use  of  a  digital  computer,  the 
several  thousand  logical  and  mathematical  steps  can  be 
performed  in  seconds  and  at  minimal  cost.  The  programming 
for  this  research  was  performed  at  the  Yale  Computer 
Center.  Except  for  some  initial  data  manipulation 
employing  SPSS  (Statistical  Package  for  the  Social 
Sciences  17)  ,  all  programming  was  in  the  Fortran  IV 

TO 

Language,  an  IBM  package  for  the  manipulation  of 
numberical  data.  Computations  were  performed  on  the 
IBM  370/158  using  data  card  input  and  printed  output. 
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The  cost  of  the  program  is  minimal.  Using  the  most 
comprehensive  output  format,  a  "run"  averages  $0.80. 

Of  this,  $0.25  is  a  fixed  "job  charge"  and  approximately 
$0.45  is  spent  reading  the  input  deck  and  the  symptom- 
disease  matrix.  Less  than  $0.10  is  actually  spent 
reading  in  the  patient  symptom  information,  performing 
the  thousands  of  logical  and  mathematical  computations, 
and  printing  the  output.  The  cost  of  reading  the  deck 
and  matrix  could  be  decreased  through  the  use  of  binary 
input.  Even  the  use  of  an  on-line,  interactive  program 
would  probably  amount  to  less  than  $1.00  per  day. 

The  program  involves  three  steps.  First,  the 
symptom-disease  matrix  is  read  into  the  computer, 
followed  by  data  on  one  or  more  patients.  Next,  the 
computations  are  performed  using  Bayes  Law.  The 
results  -  the  probability  of  each  of  the  twenty  diseases 
occurring  with  the  given  set  of  symptoms  -  are  then 
printed . 

The  actual  progarm  is  shown  in  Appendix  .  The 
majority  of  steps  are  either  input-output  instructions 
or  "book-keeping",  i.e.,  instructions  which  are  necessary 
only  to  the  computer  program  and  have  no  bearing  on  the 
actual  Bayesian  analysis.  The  remainder  of  the  program 
consists  of  the  instructions  for  the  calculations  of 
Bayes  Law.  As  shown  in  Chapter  III,  Bayes  Law,  with  the 
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assumption  of  conditional  independence  of  variables, 
can  be  written  as: 


P  (D. )P  (S  /D. )  (1-P (S  /D. )  )  . .  .P (S  /D. ) 

P  (D./S  ,S  ,  .  .  .  ,S  )  =  - - - - - - - - - - - ^ - i_ 

n  P(D1)P(S1/D,)(1-P(S./D1))...P(S  /DJ  + 
111  11  n  1 

P  (D2)P  (S  /D2)  (1-P  (S2/D2)  )  .  . .P (Sn/D2)  +  .  .  .+ 

P  (D  )P(S,/D  )  ( 1-P  (S  o/0  ))...P(S  /D  ) 
m  r  m  2'  m  n'  m 


Since  the  symptoms  and  the  symptom-disease  matrix  were 
organized  into  choices,  S ^  (symptom  S ^  being  absent) 
would  not  occur.  Instead,  some  other  symptom,  possibly 


S •  i  or  S .  , ,  would  corresoond  to  the  absence  of  symptom 
j+1  j-1 

S..  Bayes  Law  now  reverts  to: 


P  (D. )P (S  /D. )P (S  /D  )  . . .P  (S  /D. ) 

P(D./S1#S3 . S  )  = - i ^ ^ - 

P(D1)P(S1/D1)P(S3/D1)...P(Sn/D1)  + 

P  (D2)P  (Sj/D^P  (S3/n2)  .  .  .P  (Sn/D2)  +. . .+ 
P(Dm)P(Sl/Dm)P(S3/Dm)-*-P(VDm) 


As  noted  in  previous  sections,  the  matrix  is  in  the  form 
of  probabilities , while  the  data  on  each  patient  or  case 
is  in  the  form  of  ones  or  zeroes,  i.e.,  the  patient  either 
has  or  does  not  have  the  symptom.  To  determine  the 
probability  of  each  disease,  one  needs  to  calculate  both 
the  numerator  and  denominator.  Fortunately,  the  denominator 
is  identical  for  every  disease  for  a  given  patient,  and 
is  in  fact,  the  sum  of  all  the  numerators.  A  great  savings 
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is  accomplished  by  calculating  each  of  the  twenty 
numerators  separately,  finding  their  total  and  dividing 
each  by  the  total.  This  yields  results  in  the  form 
of  probabilities.  To  express  the  results  as  percentages, 
each  probability  is  multiplied  by  one  hundred  percent. 
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CHAPTER  V 


RESULTS  AND  DISCUSSION 

Of  the  three  populations  studied,  the  first  (190 
patients)  was  employed  in  the  construction  of  the  symptom- 
disease  matrix  so  evaluation  of  their  results  would  be 
inappropriate.  The  matrix  had  been  constructed  previous 
to  the  analysis  of  the  second  group  (50  patients)  so  a 
statistical  evaluation,  while  fraught  with  the  difficul¬ 
ties  present  in  any  retrospective  chart  review,  is 
valuable.  The  third  population,  the  twenty  patients 
evaluated  prospectively,  represents  the  "target"  popula¬ 
tion.  While  small  in  number,  this  is  the  ideal  group 
for  statistical  review. 

The  data  analysis  for  each  patient  was  printed 
in  a  format  similar  to  the  one  shown  in  Appendix  4.  The 
clinical  (pre-catheterization)  diagnosis  is  printed  along 
with  the  computer's  "predicted-diagnosis"  and  a  differential, 
any  diagnosis  which  received  greater  than  a  one  percent 
probability  in  the  Bayesian  analysis.  The  choice  of 
one  percent  as  the  limit  for  a  diagnosis  to  be  considered 
significant  was  an  arbitrary  one  arrived  at  after  review 
of  the  initial  one-hundred  ninety  patients.  With  knowledge 
of  the  operation  of  a  Bayesian  system,  one  learns  that 
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simple  shifts  in  the  data  base  can  produce  significant 
changes  in  the  predicted  likelihood  for  any  individual 
disease.  Ignoring  a  diagnosis  with  a  probability  of 
1.5%  could  limit  the  value  of  a  Bayesian  analysis,  a 
program  which  does  not  "diagnose"  a  clinical  situation, 
but  rather  predicts  the  likelihood  of  the  given  set  of 
symptoms  in  each  of  the  disease  under  consideration. 
Theoretically,  a  situation  could  arise  in  which  each  of 
the  twenty  diseases  could  enter  into  the  differential 
diagnosis  with  a  probability  of  greater  than  one  percent, 
and  the  Bayesian  analysis  would  have  done  little  to  clarify 
the  diagnostic  situation.  In  practice,  the  differential 
diagnosis  generally  approximates  five  diseases,  on 
occasion  it  was  as  large  as  nine  and  in  several  instances 
only  one  diagnosis  received  greater  than  a  one  percent 
probability . 

In  reviewing  the  data  for  the  original  cases,  an 
additional  problem  became  apparent.  The  analysis  had 
difficulty  differentiating  the  categories  normal  and 
primary  pulmonary  disease.  This  reflects  the  similarity 
of  data  for  the  two  categories.  In  addition,  the  majority 
of  patients  who  are  diagnosed  as  normal  are  evaluated  for 
tachypnea  or  cyanosis,  both  of  which,  if  non-cardiac,  are 
most  likely  of  pulmonary  origin.  The  inability  of  the 
program  to  differentiate  these  categories  has  required 
that  they  be  combined.  The  new  category  is  entitled 


. 
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"non-cardiac  disease" .  Data  provided  in  the  analyses 
consists  of  the  total  of  the  two  separate  categories 
rather  than  a  separate  entity. 

For  the  fifty  patients  studied  retrospectively, 
the  Bayesian  "predicted  diagnosis"  agreed  with  the 
catheterization  or  final  clinical  diagnosis  in  fifty- 
two  percent.  For  nine  (18%) ,  the  clinical  diagnosis 
received  the  second  greatest  percentage  in  the  analysis, 
and  for  an  additional  three  (6%),  the  clinical  diagnosis 
corresponded  to  the  third  most  likley  computer  diagnosis. 
In  almost  ninety  percent  of  the  cases,  the  clinical 
diagnosis  was  listed  in  the  computer's  differential 
diagnosis,  i.e.,  in  only  twelve  percent  of  the  cases 
did  the  actual  diagnosis  receive  less  than  a  one  percent 
probability.  Employing  a  more  conservative  cut-off  of 
two  percent  would  increase  the  percentage  of  total 
omissions  to  sixteen  percent  and  a  highly  conservative 
limit  of  five  percent  would  increase  it  to  eighteen 
percent.  The  distribution  of  clinical  versus  predicted 
diagnoses  is  shown  in  Figure  19.  Only  three  of  the 
nineteen  diagnoses  were  predicted  far  more  often  than 
they  occurred.  Tricuspid  atresia  with  a  VSD  occurred 
four  times  in  this  series  and  was  predicted  six  times. 
Coarctation  of  the  aorta  occurred  twice  and  was  predicted 
twice  as  often.  Persistent  fetal  pathways  was  present 
in  only  one  patient  but  it  was  predicted  in  two  others. 
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FIGURE  19 

Predicted  versus  actual  diagnosis  for  50  retrospective  cases 
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In  contrast,  the  Tetralogy  of  Fallot  occurred  in  six 
patients  but  was  predicted  only  once.  For  the  remainder, 
the  frequency  of  clinical  and  computer  diagnoses  was 
similar,  within  plus  or  minus  one. 

For  the  twenty  patients  studied  prospectively, 
the  Bayesian  predicted  diagnosis  agreed  with  the  final 
clinical  or  catheterization  diagnosis  in  sixty  percent 
of  the  cases.  In  three  cases  (15%)  the  actual  diagnosis 
was  given  the  second  greatest  percentage  by  the  computer 
program  and  in  one  (5%)  the  actual  diagnosis  entered  the 
computer's  differential  as  the  third  most  likely.  In 
only  one  of  the  twenty  cases  was  the  computer  unable  to 
give  the  appropriate  diagnosis  the  one  percent  probability 
required  for  entry  into  the  differential  diagnosis. 

This  was  a  premature  male  (Patient  1,  Appendix  4) 

with  certain  unusual  data  not  considered  by  the  program. 

In  addition  to  the  abnormal  findings  of  a  single  second 

heart  sound,  an  ejection  murmur  at  the  left  lower  sternal 

border,  and  left  axis  deviation,  the  chest  roentgenogram 

revealed  the  presence  of  situs  inversus  with  levocardia, 

1 

a  combination  associated  with  specific  abnormalities 
Using  the  clinical  data,  the  program  yielded  two  likely 
diagnoses,  a  VSD  (38.4%)  and  a  patent  ductus  (29.8%). 

On  reevaluation  of  the  patient  one  week  later  (prior  to 
catheterization)  the  program  suggested  either  a  VSD  (35.8%) 
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or  an  abnormality  of  the  right  ventricular  outflow  tract 
(pulmonary  atresia  or  pulmonary  stenosis) .  In  neither 
did  the  analysis  yield  greater  than  the  one  percent 
probability  for  the  correct  diagnosis  -  an  endocardial 
cushion  defect. 

The  second  cushion  defect  in  the  prospective 
series  illustrates  an  omission  in  the  symptom  list. 

The  patient's  clinical  data,  a  full-term  female  with 
hepatomegaly,  an  ejection  click,  a  systolic  murmur 
at  the  left  sternal  border,  a  northwest  electrocardio¬ 
graphic  axis,  cardiomegaly  and  increased  pulmonary  blood 
flow,  is  shown  in  Appendix  4,  patient  2.  The  predicted 
diagnosis,  a  ventricular  septal  defect,  was  considered 
five  times  as  likely  as  the  actual  diagnosis,  an  endo¬ 
cardial  cushion  defect.  In  addition  to  the  two  findings 
of  increased  pulmonary  flow  and  a  northwest  axis  which 
were  highly  suggestive  to  the  clinicians  of  the  actual 
diagnosis,  the  patient  also  displayed  Down's  Syndrome, 
making  the  clinicians  virtually  certain  of  their  diagnosis. 
The  program,  not  taking  into  consideration  the  presence 
of  Down's  Syndrome  and  its  association  with  cushion 
defects,  utilized  incidence  data  which  penalized  cushion 
defects  strongly  (0.014  for  ECDs  and  0.113  for  VSDs)  and 
subsequently  was  unable  to  give  the  appropriate  results. 

Another  error  occurred  in  a  full-term  male  (Patient 
3)  who  was  evaluated  at  birth.  Clinically  the  patient 
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displayed  only  tachypnea,  but  the  chest  roentgenogram 
demonstrated  borderline  cardiomegaly  and  a  venous 
obstructive  pattern.  Using  the  data,  the  program 
suggested  a  patent  ductus  (50%)  and  gave  the  correct 
diagnosis,  non-cardiac  disease,  less  than  a  two  percent 
probability.  Clinically  this  infant  of  a  diabetic 
mother  displayed  no  evidence  of  a  ductus  other  than 
an  abnormal  chest  roentenogram  and  the  x-ray  findings 
were  attributed  to  the  maternal  condition.  The  computer 
program,  lacking  appropriate  data  on  infants  of  diabetic 
mothers  was  unable  to  ascertain  the  appropriate  diagnosis. 

For  several  other  patients,  the  program  not  only 
provided  the  correct  diagnosis  but  gave  a  differential 
diagnosis  which  very  much  resembled  the  clinical  differ¬ 
ential.  For  Patient  4,  a  term  male  with  hepatomegaly, 
cyanosis,  normal  ausculatory  exam  and  normal  chest 
roentgenogram,  the  program  gave  a  great  likelihood  to 
both  non-cardiac  disease  (68.4%)  and  persistent  fetal 
pathways  (24.4%),  the  two  major  components  of  the  clinical 
differential.  The  final  diagnosis  was  non-cardiac  disease, 
which  agreed  with  the  predicted  diagnosis. 

Another  term  male.  Patient  5,  was  noted  to  have 
cyanosis,  increased  pulses,  a  single  second  heart  sound, 
an  ejection  murmur  at  the  left  upper  sternal  border,  a 
normal  cardiogram  and  a  chest  roentgenogram  demonstrating 
increased  heart  size  and  a  venous  obstructive  pattern. 
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Three  diagnoses  -  a  VSD  (28.6%), PDA  (27.0%)  and  truncus 
arteriosus  (23.0%)  entered  prominently  into  the  compu¬ 
ter's  differential.  The  clinicians  suspected  a  PDA 
because  of  the  pulse  intensity  and  apparently  briefly 
considered  the  other  two  diagnoses  suggested  by  the 
computer.  At  catheterization  a  truncus  was  demonstrated. 

For  another  patient,  the  computer's  diagnosis, 
while  incorrect,  was  similar  to  the  initial  clinical 
impression.  A  term  male,  Patient  6,  with  decreased 
pulses,  cyanosis,  two  murmurs  and  a  normal  chest  roent¬ 
genogram  was  considered  clinically  to  have  either  a 
myocardial  disorder  or  persistent  fetal  pathways.  The 
computer  strongly  suggested  persistent  fetal  pathways 
(60.7%)  and  suggested  only  a  slight  probability  of  a 
cardiomyopathy  (2.2%).  At  catheterization  a  complex 
defect  was  found  which  fell  into  the  transposition  complex, 
a  diagnosis  which  the  computer  had  given  its  second 
greatest  probability  (19.0%)  but  had  not  entered  promi¬ 
nently  into  the  clinical  diagnosis. 

Several  additional  cases  are  displayed  in  Appendix  4. 
These  demonstrate  both  correct  and  incorrect  predictions 
by  the  program. 

Further  evaluation  of  the  results  might  include 
examination  of  the  probabilities  yielded  by  the  computer, 
or  to  compare  them  to  clinicians'  predictions.  In  Warner's 


96 


study,  a  method  was  formulated  to  compare  clinicians' 
predictions  to  those  of  the  computer.  He  created  a 
"Diagnostic  Performance  Index"  which  is" 

DPI  =  P  x  F 

where  P  is  the  mean  probability  assigned  to  the  correct 
diagnosis,  and  F  is  the  fraction  of  cases  in  which  the 
correct  diagnosis  was  given  probability  greater  than 
one  percent.  As  his  clinicians  organized  their  differ¬ 
ential  diagnosis  lists  in  a  format  similar  to  that  of 
the  computer,  including  estimating  probabilities,  Warner 
was  able  to  provide  meaningful  comparisons  between 
clinical  and  computer  diagnoses.  In  the  present  study, 
the  clinicians  did  not  estimate  probabilities  for  their 
predicted  diagnoses,  and  rarely  discussed  the  relative 
likelihood  of  different  elements  of  their  differential. 
Thus,  it  is  difficult  to  make  statistical  comparisons 
between  the  clinician  and  computer. 

In  addition,  it  is  important  to  not  place  too  great 
a  value  on  the  actual  probabilities  suggested  by  the 
computer  program.  For  example,  one  patient  in  the 
retrospective  series,  a  premature  female  (Patient  12) 
with  normal  pulses,  a  holosystolic  murmur  at  the  left 
lower  sternal  border,  cardiomegaly  and  increased  pulmonary 
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flow  was  felt  clinically  to  have  a  ventricular  septal 
defect.  Two  diagnoses,  a  PDA  (49.7%)  and  a  VSD  (49.4%) 
entered  the  computer's  differential.  At  catheterization, 
a  ductus  and  an  intact  ventricular  septum  were  demon¬ 
strated.  To  fault  the  computer  for  not  assigning  a 
greater  probability  to  the  appropriate  diagnosis  would 
be  in  error.  The  program  responded  to  the  data  it  was 
given  by  suggesting  that  two  diagnoses  -  a  PDA  and  VSD  - 
were  almost  equally  likely.  In  other  cases,  the  low 
probability  assigned  to  the  appropriate  diagnosis  does 
not  imply  that  the  diagnosis  is  unlikely,  but  rather, 
that  other  diagnoses  must  be  seriously  entertained  in 
the  differential.  As  the  number  of  possible  diagnoses 
increases,  the  probability  assigned  to  each  must  neces¬ 
sarily  decline. 

In  spite  of  the  absence  of  data  comparing  the 
clinicians'  predictions  to  those  of  the  computer,  some 
observations  can  be  made.  For  sixteen  of  the  twenty 
patients  (80%)  the  clinicians'  original  diagnosis  agreed 
with  their  final  diagnosis  compared  to  the  computer's 
sixty  percent  performance.  For  two  patients  (10%),  the 
actual  diagnosis  entered  into  the  clinician's  prediction 
but  was  not  considered  to  be  the  most  likely  diagnosis. 
For  two  patients  (10%) ,  the  actual  diagnosis  was  given 
a  very  low  probability  by  the  clinicians.  In  addition, 
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there  were  several  instances  where  the  clinicians,  in 
examining  the  computer's  complete  differential,  felt 
that  it  was  not  only  more  complete,  but  also  more  logical, 
than  their  own. 

Through  review  of  the  twenty  prospective  cases, 
it  has  become  apparent  that  there  are  a  number  of 
weaknesses  in  the  data  base,  the  symptom-disease  matrix. 
First,  there  are  great  errors  in  the  incidence  data. 

The  category  "non-cardiac"  disease  was  given  only  a 
seven  percent  incidence  in  the  matrix  but  was  the  final 
diagnosis  for  thirty-five  percent  of  the  patients  in 
the  prospective  series.  This  is  not  surprising:  it 
was  known  that  the  retrospective  series  was  heavily 
skewed  towards  the  more  seriously  ill  patients  and 
included  few  patients  with  pulmonary  disease  or  who 
were  normal.  The  incidence  of  non-cardiac  disease  was 
estimated  from  data  which  supposedly  included  all  patients 
consulted  by  the  service,  whether  or  not  catheterization 
was  performed.  In  retrospect,  it  is  most  likely  that 
many  patients  with  non-cardiac  disease  whose  evaluations 
were  minimal  were  not  entered  into  the  Regional  Infant 
Care  Program  records.  Consequently,  the  incidence  data 
for  non-cardiac  disease  is  in  error.  Similarly,  PDAs, 
a  diagnosis  which  is  frequently  made  without  invasive 
techniques,  comprised  twenty-five  percent  of  the  prospec¬ 
tive  cases  in  contrast  to  14.7%  incidence  in  the  matrix. 
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Two  of  the  patients  in  the  prospective  series 
had  endocardial  cushion  defects,  yielding  an  incidence 
seven  times  that  of  the  symptom-disease  matrix.  While 
both  these  patients  were  catheterized. ,  it  is  likely  that 
the  diagnosis  of  a  cushion  defect  is  frequently  made 
without  catheterization.  In  patients  with  Down's  Syndrome 
and  a  northwest  electrocardiographic  axis,  the  diagnosis 
may  be  readily  apparent.  The  presence  of  Down's  Syndrome 
may  therefore  serve  as  a  contraindication  to  aggressive 
diagnostic  study.  Alternatively,  in  a  larger  series  of 
prospective  cases,  no  additional  endocardial  cushion 
defects  might  appear  and  the  incidence  data  could  more 
closely  approximate  that  of  the  matrix. 

The  incidence  of  many  of  the  remaining  diagnoses 
did  not  coincide  with  the  predictions.  There  was  only 
one  ventricular  septal  defect  (5%)  compared  to  an 
incidence  figure  of  11.3%,  and  two  transposition  complexes 
(10%)  compared  to  a  prediction  of  18.9%.  Hypoplastic 
left  heart  never  entered  the  prospective  series,  while 
it  was  predicted  to  have  a  9.9%  incidence  in  the  matrix. 
Several  possibilities  exist  to  explain  these  variations. 
First,  the  distribution  of  congenital  heart  disease  may 
be  shifting  due  to  changing  patterns  of  pre-natal  care 
or  abortions.  Second,  it  is  difficult  to  make  statistically 
valid  observations  of  the  distribution  of  nineteen  disease 
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categories  in  twenty  cases.  It  is  also  interesting 
to  note  that  historically,  certain  diagnoses  have  occurred 
in  clusters:  following  a  period  when  the  diagnosis  of 
hypoplastic  left  heart  was  never  made,  several  patients 
were  noted  in  rapid  succession  to  have  that  disease. 

The  effect  of  unknown  variables,  possibly  uterine  viral 
infections  or  drugs,  cannot  be  ignored  in  these  distri¬ 
bution  variations. 

To  assess  the  effect  of  the  incorrect  incidence 
data  on  the  program,  the  prospective  series  was  reanalyzed 
with  the  incidence  data  omitted,  i.e.,  giving  each 
disease  an  incidence  of  five  percent.  The  results 
were  generally  better  than  when  the  incidence  data  was 
employed.  The  percentage  of  correct  diagnoses  increased 
to  seventy-f ive ,  one  diagnosis  was  assigned  the  second 
greatest  probability  and  one,  the  third.  None  of  the 
differential  diagnoses  omitted  the  correct  diagnosis. 

The  lowest  probability  assigned  to  the  correct  diagnosis 
was  4.1%.  By  omitting  the  incidence  data  from  the 
analysis,  although  one  is  violating  Bayes  Law,  the  analysis 
yields  the  comparison  of  the  patient's  symptoms  to  those 
present  in  each  disease,  ignoring  the  relative  frequency 
of  each  disease.  In  two  cases  described  previously, 
this  variation  yielded  interesting  results  shown  in 
Appendix  5.  The  predicted  diagnosis  for  Patient  5 
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became  truncus  (77.7%)  which  was  the  catheterization 
diagnosis.  Patient  2  was  appropriately  diagnosed  as 
having  an  endocardial  cushion  defect  (29.6%).  For 
several  of  the  patients,  those  with  diagnoses  of 
transposition  or  PDA,  this  variation  decreased  the 
probability  assigned  to  the  correct  diagnosis,  but  in 
none  of  these  cases  did  it  change  its  location  in  the 
differential  diagnosis.  These  observations  would 
suggest  two  possibilities.  First,  the  incidence  data 
for  the  program  contains  serious  errors  which  should 
be  revised  before  the  program  can  be  used  effectively. 
Second,  they  suggest  that  a  variation  of  Bayes  Law: 

P  (D/S )  cc  P  (S/D) 

may  have  statistical  significance.  This  formula  is 
equivalent  to  the  observation  that  a  disease  is  more 
likely  if  the  symptoms  which  are  present  resemble  those 
likely  in  the  disease,  even  if  the  disease  is  known  to 
occur  rarely.  This  is  similar  to  the  clinician  making 
his  diagnosis  in  the  absence  of  incidence  data,  or 
employing  incidence  data  in  crude  forms  such  as  rare, 
frequent,  etc.  An  error  potentially  introduced  by  this 
variation  might  be  to  overdiagnose  rare  diseases. 

Other  errors  have  also  become  apparent  during  the 
testing  of  the  matrix.  For  example,  the  occurrence 
of  decreased  pulses  in  Ebstein's  disease  was  set  at  the 
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default  level.  Subsequently,  it  has  become  apparent 
to  the  author  that  decreased  pulses  are  probably  common 
in  Ebstein's  disease  and  the  value  should  be  set  higher. 
The  data  on  the  distribution  of  murmurs  similarly 
contains  errors.  Ejection  murmurs  at  the  left  upper 
sternal  border  were  frequently  reported.  In  many  cases, 
the  computer  employed  this  to  give  a  disproportionately 
high  value  to  aortic  stenosis.  Although  ejections 
murmurs  at  that  location  are  consistent  with  aortic 
stenosis,  the  data  must  be  changed  to  accomodate  the 
frequency  with  which  that  murmur  is  reported  in  associa¬ 
tion  with  other  diagnoses.  Other  errors  in  electrocardio¬ 
graphic  data,  and  roentgenographic  findings  have  become 
apparent  and  are  being  revised  for  further  testing  of 
this  program.  One  piece  of  data,  the  occurrence  of 
cyanosis  in  the  "non-cardiac  disease"  population  was 
surprisingly  accurate.  Initially,  the  probability  was 
set  at  seventy-five  and  sixty  percent  for  the  "normal" , 
"primary  pulmonary  disease"  populations  respectively. 

In  the  prospective  series,  of  the  seven  patients  in  the 
"non-cardiac"  category,  four  displayed  cyanosis. 

A  third  group  of  errors  involves  the  omission  of 
valuable  entries  from  the  symptom  list.  As  noted,  the 
omission  of  Down's  Syndrome  may  create  difficulty  in  the 
diagnosis  of  endocardial  cushion  defects.  With  the 
collection  of  appropriate  data,  it  would  be  possible  to 
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incorporate  this  finding.  Other  symptoms  originally 
included  in  the  program  but  ignored  in  the  data  analyses 
are  liver  size,  pulse  and  respiratory  rate,  gallops 
and  ejection  sounds  and  the  direction  of  T  in  in 
patients  four  days  or  older.  Each  was  omitted  because 
of  insufficient  data  or  the  observation  that  data  grouped 
too  closely  around  the  cut-off  points.  For  example, 
the  respiratory  rate  in  this  population  is  so  frequently 
reported  as  seventy  per  minute  that  tachypnea  appeared 
to  be  of  little  value  as  a  discriminating  factor.  As 
data  is  collected  prospectivly  and  a  larger  data  base 
is  obtained,  it  will  be  possible  to  incorporate  each 
of  these,  and  possibly  other  symptoms  and  signs. 

Finally,  errors  introduced  by  the  assumption  of 

independence  of  variables  must  be  considered.  While 

this  assumption  has  not  yet  appeared  to  influence  the 

results,  errors  could  theoretically  occur.  For  example, 

if  the  presence  of  Down's  Syndrome  is  introduced  as  a 

variable,  the  incidence  data  is  no  longer  independent. 

For  patients  with  Down's  Syndrome,  there  is  a  distinct 

3 

distribution  of  congenital  heart  abnormalities,  m 
contrast  to  the  rarer  occurrence  of  cushion  defects  in 
the  remainder  of  the  population.  Findings  whose  simul¬ 
taneous  occurrence  have  more  than  a  multiplicative 
effect  on  the  diagnosis,  would  similarly  defy  the  assump- 


104 


tion  of  independence.  However,  as  a  Bayesian  analysis 
becomes  unwieldly  in  the  absence  of  this  assumption, 
it  is  necessary  despite  theoretical  limitations. 

Despite  these  errors,  the  program  has  yielded 
results  which  frequently  resemble  the  actual  diagnosis. 
Although  comparison  to  the  clinician's  predictions  is 
difficult,  the  program  does  not  appear  to  be  signifi¬ 
cantly  less  accurate.  With  the  introduction  of  better 
data  it  may  be  possible  to  employ  the  program  as  an 
effective  adjunct  to  clinical  diagnosis.  The  need  for 
pediatric  cardiologists  would  not  be  lessened  through 
the  use  of  the  program.  Clinicians  are  still  needed 
to  make  the  clinical  observations,  perform  the  auscul¬ 
tatory  exam,  read  the  roentgenograms,  perform  the 
catheterization,  etc.  The  program  is  not  capable  of 
recognizing  gross  inconsistencies  in  the  data.  Also, 
the  computer  makes  no  recommendations  regarding  therapeutic 
intervention,  surgery,  etc.  It  is  only  capable  of 
predicting  the  likelihood  of  diseases  from  a  given  set 
of  symptoms. 
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CHAPTER  VI 


BEYOND  BAYES 

In  the  previous  two  chapters,  the  methodology 
and  results  of  a  Bayesian  diagnostic  model  have  been 
presented.  It  has  been  demonstrated  that  a  Bayesian 
model  is  capable,  despite  the  recognized  limitations 
in  the  data  base,  to  assemble  a  differential  diagnosis 
which  is  appropriate  to  the  clinical  situation,  and 
often  assigns  the  appropriate  diagnosis  the  greatest 
probability.  The  utility  of  such  a  model  is  great  and 
a  few  of  the  implications  are  to  be  presented  in  this 
chapter . 

The  value  as  an  adjunct  to  clinical  diagnosis  is 
apparent.  The  Leeds  model  for  the  diagnosis  of  acute 
abdmonial  pain  has  demonstrated  that  the  accuracy  of 
a  Bayesian  model  can  exceed  that  of  clinicians  at  many 
levels  of  training.  Other  studies  by  Warner  and  Lodwick 
have  shown  that  Bayesian  models  can  yield  results  similar 
to  skilled  clinicians. 

The  opportunity  for  a  Bayesian  model  to  influence 
a  skilled  clinician  in  his  preparation  of  a  differential 
diagnosis  must  be  considered.  Warner  has  noted  that  the 
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two  clinicians  involved  in  his  studies  increased  their 
own  diagnostic  accuracy  markedly  during  the  study.  "The 
extent  to  which  the  improved  performance  of  the  physicians 
is  the  result  of  experience  in  preparing  data  for  and 
receiving  feedback  from  the  computer  over  this  period 
of  time  is  difficult  to  evaluate.  It  is  interesting  in 
this  regard  that  observer  (AFT),  who  improved  the  most, 
had  the  most  direct  contact  with  the  computer  results 
over  the  period  of  this  study. Alternatively,  the 
improvement  may  have  resulted  from  the  clinician  being 
forced  to  collect  a  complete  data  base. 

The  Leeds  group  has  also  applied  their  data  to 

2 

teaching  situations.  Through  the  use  of  random 
number  charts  and  a  desk-top  computer,  artificial  case 
histories  can  be  created  for  which  the  student  can 
assemble  a  differential  diagnosis.  In  light  of  the 
known  accuracy  of  this  particular  Bayesian  model,  it 
can  be  instructive  to  the  student  to  compare  his  own 
diagnosis  with  the  computers.  Regretably,  the  accuracy 
of  the  diagnosis  cannot  be  compared  to  a  real  patient, 
but  "there  are  isolated  occasions  (such  as  when  a  patient 
scheduled  for  bedside  teaching  goes  home  or  refuses 
permission)  when  a  series  of  artificial  substitutes 
might  be  useful.  Indeed,  (we)  have  found  our  series 
quite  useful  on  occasion  -  not  so  much  for  the  cases 
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themselves  as  for  the  subsequent  discussion  with  the 

students,  to  whom  the  concept  of  'certainty'  in  diagnosis 

3 

is  often  new  and  intriguing." 

An  important  implication  of  Bayes  Law  is  the 
ability  to  examine  components  of  the  diagnostic  workup. 
Having  compiled  and  tested  a  symptom  disease  matrix, 
it  is  possible  to  omit  an  individual  symptom  or  finding 
from  the  analysis  and  determine  the  effect  on  diagnosis. 
If  it  is  found  to  have  no  effect  the  utility  of  that 
symptom  for  the  diagnosis  of  that  disease  is  low.  A 

4 

mathematical  approach  to  this  problem  proposed  by  Warner 
involves  evaluation  of  the  "information  content"  of  a 
symptom  for  a  disease.  This  "may  be  defined  as  the 
logarithm  of  the  ratio  of  the  probability  that  symptom  x^ 
is  present  or  absent  in  any  patient  from  this  population. 

I  =  In (P (x1/y1) /P (x1) ) 

Now  this  information  can  be  either  positive  or  negative 
since  the  probability  of  a  particular  symptom  in  a 
given  disease  may  be  greater  or  less  than  the  incidence 
of  that  symptom  in  the  group  of  diseases  under  study. 
However,  if  the  information  content  is  defined  as  the 
absolute  value  of  the  logarithm  of  this  ratio,  a  number 
is  obtained  which  is  independent  of  the  sign  of  the 
measure.  The  average  information  content  (I)  of  a  given 
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symptom  for  a  set  of 


I  = 


k  diseases  can  be  obtained  from: 
In  P(x1/yk)  -  In  P(x1)  | 


Now  since: 


p(yk/xi> 

p(yk) 

. . . ,  the  term  e1  is  the  average  factor  by  which  the 

calculated  probability  of  a  disease  is  changed  by 

finding  symptom  to  be  present  or  absent  in  the 

patient  to  be  diagnosed.  This  term  then  is  a  direct 

measure  of  the  average  value  of  that  particular  symptom 

5 

in  diagnosing  this  group  of  diseases." 

Warner  proposes  three  criteria  for  the  inclusion 
of  a  symptom  in  a  Bayesian  model:  ^ 

1.  A  symptom  should  be  one  whose  presence 
or  absence  can  be  accurately  recognized. 

2 .  A  symptom  should  be  independent  of  other 
symptoms  in  any  given  disease. 

3.  A  symptom  should  have  an  information  content 
greater  than  1.0. 

Of  these,  the  third  offers  the  greatest  opportunity  in 
the  application  of  Bayes  Law  to  medical  diagnosis. 
Applying  these  calculations  to  a  clinical  problem 
enables  one  to  determine  the  information  content  of 


P  (x1) 
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each  symptom:  those  with  very  low  information  contents 
could  be  omitted  from  the  diagnostic  workup.  Warner 
notes  that  vague  symptoms  such  as  fatigue  have  a  low 
information  content  in  his  congenital  heart  disease 

7 

model,  and  these  were  omitted  from  subsequent  analyses. 

In  a  more  sophisticated  model,  the  information  content 
of  a  symptom  can  be  compared  to  its  cost.  Those  symptoms 
with  a  low  information  content  to  cost  ratio  could  be 
omitted.  Ultimately,  Bayesian  analysis  offers  a  tech¬ 
nique  to  structure  a  medical  workup  to  optimize  results 
while  minimizing  costs  and  inconvenience  for  the  patient. 
Another  use  of  Bayes  Law,  in  sequential  processing 

g 

and  optimization,  comes  from  models  offered  by  Gorry 
9  .  . 

and  Ginsberg.  In  every  clinical  situation,  the 
clinician  proceeds  in  a  sequential  manner,  collecting 
initial  data,  formulating  a  clinical  impression,  collecting 
additional  data,  revising  the  differential  and  so  on. 

"The  value  of  information  obtained  from  a  test  is 
determined  by  the  contribution  that  this  information 
makes  to  improving  the  current  view  and  hence  reducing 
the  risk  of  misdiagnosis  with  its  associated  cost.  The 
more  information  the  doctor  obtains  about  the  patient, 
on  the  average,  the  less  risk  of  a  possible  misdiagnosis. 
Hence,  the  doctor  is  inclined  to  perform  many  tests.  On 
the  other  hand,  the  tests  available  to  him  are  not  without 
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some  cost  in  terms  of  patient  discomfort,  time  of  skilled 
persons,  money,  etc.  Thus,  there  is  a  conflicting 
tendency  to  hold  the  number  of  diagnostic  tests  to 

•  •  „io 

a  minimum. 

Gorry,  in  a  reanalysis  of  the  data  collected  by 
Warner,  proposes  to  evaluate  a  patient  for  congenital 
heart  disease  in  a  sequential  pattern.  The  clinician 
might  observe  the  age  of  the  patient  and  check  for 
cyanosis  and  murmurs.  He  may  next  obtain  an  EKG  or 
perhaps  he  would  choose  a  chest  roentgenogram.  Gorry 
optimizes  this  decision  by  determining  which  single 
test  is  most  likely  to  clarify  the  diagnostic  question. 

An  initial  description  of  the  patient  is  fed  to  the 
computer  and  a  set  of  Bayesian  probabilities  determined. 
Then,  each  of  the  other  symptoms  is  entered  separately 
into  the  program,  along  with  the  "cost"  of  the  test 
(in  this  case,  all  equivalent)  and  the  seriousness  of 
misdiagnosis  for  each  disease  (again,  all  equivalent  in 
this  case) .  The  test  which  optimizes  the  results  is 
requested  and  the  calculations  performed  again.  When 
the  introduction  of  additional  information  no  longer 
contributes  to  the  certainty  of  the  diagnosis  and  the 
"cost"  of  misdiagnosis  is  minimized,  the  program  termi¬ 
nates.  Using  sequential  processing,  the  program  yields 
results  statistically  comparable  to  Warner's.  Of  signifi- 
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cance  is  the  program's  use  of  fewer  than  seven  tests 
on  the  average,  compared  to  the  thirty-one  required 
by  Warner's  initial  progarm. 

Ginsberg  in  his  decision  analysis  model  for  the 
pleural  effusion  syndrome,  introduces  a  number  of 
variables  which  the  clinician  uses  but  have  been 
omitted  from  earlier  Bayesian  models.  The  cost  of 
tests,  the  time  delays  and  risks  involved  in  ordering 
complex  tests  or  laboratory  procedures  and  the  value 
of  further  diagnostic  intervention  are  all  considered. 

At  each  point  in  the  analysis,  the  program  optimizes 
its  decisions.  If  a  lab  test  or  procedure  adds  little 
to  the  diagnosis  in  light  of  the  present  clinical 
picture,  it  is  not  requested  by  the  program.  In  contrast, 
any  inexpensive  but  valuable  screening  procedure  such 
as  a  tuberculin  test,  enters  early  into  this  type  of 
model.  Through  the  introduction  of  parameters  of  cost, 
risk  and  time  delay  in  clinical  diagnosis,  despite  the 
different  values  placed  on  these  factors  by  different 
clinicians,  it  is  possible  to  expand  statistical  decision 
analysis  to  more  complex  medical  problems. 

The  role  which  Bayes '  .Law  and  computational  devices 
will  play  in  the  future  of  medical  diagnosis  depends 
upon  physician  acceptance.  The  opportunities  can  be 
well  illustrated  by  an  analogy.  "The  clinician  often 
uses  a  stethoscope  to  augment  his  ability  to  hear  sounds 
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emanating  from  within  a  body  cavity.  Sometimes  the 
clinical  picture  is  clearcut;  in  this  instance  the 
clinician  merely  uses  his  stethoscope  to  confirm  his 
previous  assessment  of  the  patient.  Sometimes  the 
results  which  the  clinician  obtains  from  the  use  of  the 
stethoscope  are  difficult  to  interpret  or  are  at  odds 
with  what  the  clinician  'feels'  about  the  case  -  in 
such  circumstances  the  clinician  is  at  liberty  to  dis¬ 
regard  the  evidence  from  his  'machine'.  But  in  a 
proportion  of  cases  the  evidence  the  clinician  obtains 
will  alter  his  impression  of  the  case  sufficiently  to 
make  him  seek  additional  evidence  and  this  in  turn  will 
lead  him  to  the  correct  diagnosis."’*’^ 

So  it  is  with  a  Bayesian  program.  By  helping  to 

sort  through  large  quantities  of  data,  an  area  in  which 

12 

physicians  are  known  to  be  relatively  weak  ,  the 
computer  can  provide  suggestions  which  may  aid  the 
clinician  in  arriving  at  the  appropriate  diagnosis. 
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DO  99  1=2,74 

99  PEA0(5,98)(AFt*AY(I»J)  »  J  =  1  » 2  0  ) 

98  FOOMAT ( 7X.20F3.2 ) 

RE  AD (5, 150)  DOC 

150  FORMAT ( 5X, 3A4) 

R. E  A 0  <  5  ,  l  51  )  'OESC 

151  FORMAT (  5  X,  15A4I 
REAC  (5,  152)  CIS 

152  FCRMAT(5X,1 l A4) 

114  R  E ADI  5 ,  l 01, £60=  509 )L UNI T »C A  St ,NA  , MON , DATE , YE  A P , NL DX , NT OX , NAM c 

101  eCRMAT(T6,lX,53F1.0/7X,21F1.0/7X,12,3(lX,A2),2(lX,I2),lX,5A4) 
IF  (NLOX.EQ.O)  N  LO  X  =2 1 

IF  (NTDX.EQ.O)  NT  DX=  2  1 

WP ITE(  6 , 500) NAMF ,LUNIT,  (DOC ( I , N A ) , I  =  1 , 3 ) , MOM, D AT  £ , Y F AR 

500  FORMAT (  •  1  • / / •  PATIENT:  ',5A4//‘  UNIT  «:  ',16//'  CLINICIAN: 

*  '  «  3  A4  /  /  •  DATE:  •  ,  A2,  •  /  •  ,  A2,  •  /  '  ,  A2// '  S  YMPT  ;3M  S :  '  /  ) 

DO  501  1=2,74 

IF  (CASE  (  I  ) . £0.0 )  00  TO  501 
WP.  I  TF  (  6 , 502  )  (  nE  SC  (  K  ,  I  )  ,  K  =  1 , 15  ) 

502  FORMAT ( 5  X,  15A  4) 

501  CONTINUE 

WR I TE ( 6 , 701  I  ( DI S  (  L  ,  N  L  P  X  ) ,1  =  1,11) 

701  FORMAT!//'  CLINICAL  DIAGNOSIS:  '//5X, 11A4) 

WP I TE ( 6 , 503 ) 

503  FORMAT (  /  / '  PREDICTED  DIAGNOSIS:'/) 

C  IF  VARIABLES  ARE  TO  BF  OMITTED,  CASE!  )=0  MUST  BE  DECLARE)  NCW. 

case ( n  =i.o 

CASE( 19  )  =0.0 
CASEI20) =0.0 
CASE(29)=0.0 
CASE  (30) =0.0 
C  A  S E ( 3  3 )  =0.0 
CASE! 3  4 ) =  0 . 0 
CASE  (64 ) =0.0 
CASE ( 65  I =0.0 
TOT  AL  =  0  •  0 
DO  103  J=1 ,20 
BAYES! J I =1.0 
DC  102  1=1,74 

IFICASclD.EO.O.O)  GO  TO  102 
BAYES( J) =  B A YF  $ (J*» ARRAY! I,J) 

102  CONTINUE 

103  TOTAL  =  TOTAL«-PAYES.(  J) 

00*104  J  =  1 , 20 


104  PR  OB  I  J)  =  1 00’’  B  AY  r  S  (  J) /TOTAL 
PROS ( l 7) =pcnp ( 17 ) +PPOB ( 18 ) 

PROB (18 ) =0.0 

K  =  1 

DUMM  Y=  PP  0  B (  1) 

00  400  1=2,20 

IF  (PROBU  ).LE.  DUMMY)  GO  TO  400 

oummy=prob( i i 
K  =  I 

400  CONTINUE 

WRITE16,401)(DIS(L,K),L=1,11),PR0B(K) 

401  EOPMAT(5X,11A4,IOX,'  ( •  ,F4. 1  ,  'XI  '  I 
PPOB (K ) =0.0 

WR ITF( 6, 504) 

504  FORMAT!//'  OTHER  POSSIBLE  DIAGNOSES:'/) 
00  508  M=1,19 

K=1 

OUMMY=PRDB( 1 ) 

DO  450  1=2,20 

IF  ( PROB  ( I  I . L  F. DUMMY)  GO  TO  450 
DUM  MY  =  P R  C8 ( I ) 

K=I 

450  CONTINUE 

IF  (PROB(K) .LT.1.0)  GO  TO  505 

WR !TE( 6, 401) (TT  S( L,K )  ,L  =  1  ,11)  ,PROB(K) 

508  PF  OB  (  X 1  =  0.  0 
M=M  +  1 

505  IF  (M.NE.l)  GO  TO  506 
WRITE  (6,507) 

507  FORMAT! *  NONE • ) 

506  GO  TO  114 

509  CC  NT  T  NUE 
END 


D°  . 


I  ' 


’ 

. 

' 


APPENDIX  3 


Symptom-Disease  Matrix 
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APPENDIX  4 


Sample  Patients 


PATIENT:  1 

CLINICIAN:  DR.  RESIDENT 

DATE:  11/  6/75 

SYMPTOMS: 

LE  36  WEEKS  GESTATIGN  • 

LE  3  CAYS  OLD 
MALE 

PULSE  RATE  GT  170 
NORMAL  PULSE  INTENSITY 
NO  DIFFERENTIAL  PULSES 
L  I  VHP  LE  2  C  M  BELOW  CM* 

NO  DIFFERENTIAL  CYANOSIS 
RESPIRATORY  RATE  LE  70 
CYANOSIS 
PH  LE  7.29 

5 2  SINGLE 

53  ABSENT 

54  ABSENT 

EJECTION  CLICK  PRESENT  ^ 

NO  DIASTOLIC  MURMER 

SYSTOLIC  MUR  MEF  AT  THE  LEFT  LOWER  STERNAL  BORDER 

PP  INTERVAL  LE  0.11 

ORS  INTERVAL  LE  0.06 

AXIS  27C-360  OR  0-59 

R/S  IN  VI  LE  10  AND  GE  0.8 

INCREASED  PULMONARY  BLCCD  FLOW  W/0  VENOUS  OBSTRUCTION 
MODERATELY  INCREASED  HEART  SIZE 


CLINICAL  DIAGNOSIS: 

ENDOCARDIAL  CUSHICN  DEFECT 


PINAL  DIAGNOSIS: 


ENDOCARDIAL  CUSHION  DEFECT 


PREDICTED  DIAGNOSIS: 


VENTRICULAR  SEPTAL  DEFECT  (38.4*) 

OTHER  POSSIBLE  DIAGNOSES: 

PATENT  DUCTUS  ARTERIOSUS  (29.8?) 

TRANSPOSITION  OF  THC  GREAT  ARTERIES  (14.0%) 

TRICUSPTD  ATRESIA  WITH  VSD  (  7.9%) 

COARCTATION  WITH  A  VENT  SEPTAL  DEFECT  (  2.8%) 

HYPOPLASTIC  LEFT  HEART  (  2.4%) 

PULMONARY  ATRESIA  (  1.6%) 

TRICUSPID  ATRESIA  WITH  INTACT  VENT.  SEPTUM  (  1.5%) 


r 


PATIENT:  2 

CLINICIAN:  CR.  GLANZ 

OATE:  10/26/75 

SYMPTOMS: 

GE  37  WEEKS  GESTATION 
GE  4  DAYS  OLD 

female 

0 UL  SE  PATE  GT  170 
NOFMAL  PULSE  INTENSITY 
NO  DIFFERENTIAL  PULSES 
LIVER  GT  2  CM  BELCW  CM 
RESPIRATORY  RATE  GT  70 
NO  CY ANOS  IS 
PH  GE  7.30 
S  2  SPLIT 

53  ABSENT 

5 4  ABSENT 

EJECTION  CLICK  PRESENT 
NO  DIASTOLIC  VUFMEP. 

SYSTOLIC  MijEMEP  AT  THE  LEFT  LOWER  STERNAL  BORDER 
SYSTOLIC  MUPMEP  AT  THE  LEFT  UPPER  STERNAL  BORDER 
P D  INTERVAL  LE  O.II  ‘ 

ORS  INTERVAL  LE  0.06 
AXIS  181-269 

R/S  IN  VI  LE  1C  AND  GE  C.  8 
T  IN  VI  GT  0  IN  PATIENT  AGE  GT  3  DAYS 

INCREASED  PULMONARY  BLOOD  FLOW  W/0  VENOUS  OBSTRUCTION 
MODERATELY  INCREASED  HEART  SIZE 
LEFT  AORTIC  APCH 


CLINICAL  DIAGNOSIS: 

ENDOCARDIAL  CUSHION  DEFECT 


FINAL  DIAGNOSIS: 

ENDOCARDIAL  CUSHICN  DEFECT 


PREDICTED  CIAGNOSIS: 


VENTRICULAR  SEPTAL  DEFECT  (51.61) 

OTHER  POSSIBLE  DIAGNOSES: 

TOTAL  ANDMCLOLS  PULM  VENOUS  ORAINAGE  (12.72) 

PATENT  DUCTUS  ARTERIOSUS  (10.02) 

ENDOCARDIAL  CUSHION  DEFECT  (  9.32) 

P UI  MONAP Y  STENOSIS  (  6.72) 

COARCTATICN  WITH  A  VENT  SEPTAL  DEFECT  #  (  3.52) 

AORTIC  STENOSIS  *  {  2.42) 

TRANSPOSITION  OF  THE  GREAT  ARTERIES  (  1.42) 


PATI  ENT 


3 


CL  INICIAN:  OP.  MATISOFF 


DATE 


15/16/75 


SYMPTOMS  : 

GE  37  WEEKS  GESTATION 
Lt  3  CAYS  CLD 
FEMALE 

PULSE  PATE  LE  170 
NQcmal  PULSE  INTENSITY 
NO  DIFFERENTIAL  PULSES 
LIVEP  LE  2  CM  BELOW  CM 
NO  DIFFERENTIAL  CYANOSIS 
RESPIRATORY  RATE  GT  70 
NO  CYANOSIS 
PH  GF  7. 30 
S  2  SPLIT 

53  ABSENT 

5 4  ABSENT 

EJECTION  CLICK  ABSENT 

NO  SYSTOLIC  MURMER  ' 

NO  DIASTOLIC  MURMER 
DP  INTERVAL  LE  0.11  ‘ 

OPS  INTERVAL  LE  0.06 
AXIS  60-180 

R/S  IN  VI  LF  10  AND  GE  0.8 

INCREASED  PULMONARY  BLOOD  FLOW  WITH  VENOUS  OBSTRUCTION 
MODERATELY  INCREASED  HEART  SIZE 
LEFT  AORTIC  ARCH 


CLINICAL  DIAGNOSIS: 

NON-CARDIAC  DISEASE 


PINAL  DIAGNOSIS: 

NON-CARD  I  AC  DISEASE 


PREDICTED  DIAGNOSIS 


PATENT  DUCTUS  ARTERIOSUS 


( 50.021 


OTHER  POSSIBLE  DIAGNOSES:  • 


VENTRICULAR  SEPTAL  DEFECT 

TOTAL  ANCMDLOLS  PULM  VENOUS  DRAINAGE 

PRIMARY  MYOCARDIAL  DISEASE 

TP.  ANS  PCS  IT  ION  OF  THE  GREAT  ARTERIES 

NON-CARDIAC  DISEASE 

CO  AF  CT  AT  ION  OF  THE  AORTA  / 


/ 


(21.92) 
(  12.02) 
(  8.72) 
(  3.62) 
(  1.52) 
(  1.32) 


PATIENT 


4 


CLINICIAN:  DP.  RESIDENT 

DATE:  10/23/75 

SYMPTOMS : 

GE  37  WEEKS  GESTATION 

L  E  3  CAYS  OLD 

MALE 

PULSE  RATE  LE  170 
NORMAL  PULSE  INTENSITY 
NO  DIFFERENTIAL  PULSES 
LIVER  GT  2  CM  BELOW  CM 
NO  DIFFERENTIAL  CYANOSIS 
RESPIRATORY  RATE  GT  70 
CYANOSIS 

P02  INCREASE  WITH  100?  02  GE  30 
PH  L E  7.29 

5 2  SPLIT 

53  ABSENT 

54  ABSENT  /" 

EJECT  I CN  CLICK  ABSENT 

NO  SYSTOLIC  MURMER 
NO  DIASTOLIC  MURMER 
PR  INTERVAL  LE  0.11 
OR  5  INTERVAL  LE  0.06 
AXIS  60-180 

R/S  IN  VI  LE  10  AND  GE  0.8 
NORMAL  PULMONARY  BLOOD  FLOW 
NORMAL  HEART  SIZE 
LEFT  ACRTIC  ARCH 


CLINICAL  DIAGNOSIS: 

NON-CARDIAC  DISEASE 


FINAL  DIAGNOSIS: 

NON-CARDIAC  DISEASE' 


PREDICTED  DIAGNOSIS: 

NON-CARDIAC  DISEASE  (68.4?) 


OTHER  POSSIBLE  DIAGNOSES: 

PERSISTENT  FETAL  PATHWAYS  (24.4?) 

CQARCT  AT ICN  OP  THE  AORTA  (  4.1?) 

PRIMARY  MYOCARDIAL  OISEASE  '  (  1.1?) 

•  • 


. 

. 


PATIENT:  5 

CLINICIAN:  dr.  talner 
DATE:  11/25/75 

SY  mpt  CPS : 

GE  37  WEEKS  GESTATION 

GE  4  DAYS  OLD 

MALE 

PULSE  RATE  LE  170 
INCREASED  PULSE  INTENSITY 
NO  DIFFERENTIAL  PULSES 
L IVEF  LE  2  CM  BELOW  CM 
NO  DIFFERENTIAL  CYANOSIS 
RESPIRATORY  RATE  GT  70 
CYANCS IS 

P02  INCREASE  WITH  100?  02  GE  30 
PH  GE  7.30 

52  SINGLE 

53  ABSENT 

5 4  ABSENT 

EJECTION  CLICK  ABSENT  f 

NO  DIASTOLIC  MURMER 

EJECTION  MUP.MER  AT  THE  LEFT  UPPER  STERNAL  BORDER 
pc  INTERVAL  LE  0.11 
QRS  INTERVAL  LE  0.06 
AXIS  6 0—1 80 

R /S  IN  VI  LE  10  AND  GE  0.  8 
T  IN  VI  LE  0  IN  PATIENT  AGE  GT  3  DAYS 

INCREASED  PULMONARY  BLOOD  FLOW  WITH  VENOUS  OBSTRUCTION 
MARKEDLY  INCREASED  HEART  SIZE 
LEFT  AORTIC  ARCH 


CLINICAL  DIAGNOSIS: 

PATENT  DUCTUS  ARTERIOSUS 

c  INAL  DIAGNOSIS: 

TRUNCIS  ARTERICSUS 


PREDICTED  DIAGNOSIS: 


VENTRICULAR  SEPTAL  DEFECT  (28.6?) 

OTHER  POSSIBLE  DIAGNOSES: 

PATENT  DUCTUS  ARTERIOSUS  "  (27.0?) 

TRUNCUS  ARTERIOSUS  (23.6?) 

TRANSPOSITION  OF  THE  GREAT  ARTERIES  (10.7?) 

COARCTATION  OF  THE  AORTA  (  6.6?) 

HYPOPLASTIC  LEFT  HEART  (  1.5?) 


PATIENT:  6 

CLINICIAN:  DR.  NIJDEL 

DATE:  12/24/75 

SYMPTOMS : 

GE  37  WEEKS  GESTATION 

LE  3  CAYS  OLD 

MALE 

PULSE  PATE  LE  170 
DECREASED  PULSE  INTENSITY 
NO  DIFFERENTIAL  PULSES 
L  I V  EP  LE  2  C  M  BELOW  CM 
NO  DIFFERENTIAL  cyanosis 
RESPIRATORY  RATE  LE  70 

cyanosis 

PQ2  INCREASE  WITH  100*  02  LT  30 
S  2  SPLIT 

53  PRESENT 

54  ABSENT 

EJECTION  CLICK  ABSENT 
NO  DIASTOLIC  MURMER 

HOLOSYSTOLIC  MURDER  AT  THE  LEFT  LOWER  STERNAL  BCRDER 
EJECT  I C N  MURMER  AT  THE  LEFT  UPPER  STERNAL  BORDER 
OPS  INTERVAL  LE  0.06 
AXIS  6C-180 

F/S  IN  VI  LE  10  AND  GE  0.8 
NORMAL  PULMONARY  BLOOD  FLOW 
NORMAL  HEART  SIZE 
LEFT  ACRTIC  ARCH 


CLINICAL  DIAGNOSIS: 

PRIMARY  MYOCARDIAL  DISEASE 


FINAL  DIAGNOSIS: 

TRANSPOSITION  OF  THE  GREAT  ARTERIES 


PREDICTED  DIAGNOSIS: 


PERSISTENT  FETAL  PATHWAYS  (60.7*) 

OTHER  POSSIBLE  DIAGNOSES: 

TRANSPOSITION  OF  THE  GREAT  ARTERIES,  (19.0*) 

NCN-CARDIAC  DISEASE  (10.4*) 

HYPOPLASTIC  LEFT  HEART  (  2.5*) 

PRIMARY  MYOCARDIAL  DISE'ASE  ,  (  2.2*) 

PULMONARY  STENOSIS  (  1.8*) 

VENTRICULAR  SEPTAL  DEFECT  (  1.4*) 


PATIENT 


7 


CLINICIAN:  CR.  MATISCFF 

DATE:  12/21/75 

SYMPTOMS: 


LE  36  WEEKS  GESTATION 
GE  4  DAYS  OLD 
FEMALE 

PULSE  PATE  LE  170 
INCREASED  PULSF  INTENSITY 
NO  DIFFERENTIAL  PULSES 
L I VER  LE  2  CM  BFLCW  CM 
RESPIRATORY  RATE  LE  70 
NO  CYANOSIS 
PH  GF  7.30 

52  SPLIT 

53  ABSENT 

5 4  ABSENT 

EJECTION  CLICK  ABSENT  ,  . 

CCNTINOUS  MURMER  AT  THE  LEFT  LOWER  STERNAL  BORDER 
C  ONT INCUS  MURMER  AT  THE  LEFT  UPPER  STERNAL  BORDER 
PR  INTERVAL  GT  0.11 
OR  S  INTERVAL  LE  0.06 
AXIS  6C-180 

P /S  IN  VI  LE  10  AND  GE  0. 8 
NORMAL  PULMONARY  BLOOD  FLOW 
NORMAL  HEART  SIZE 
LEFT  AORTIC  ARCH 


CLINICAL  DIAGNOSIS: 

PATENT  DUCTUS  ARTERIOSUS 


FINAL  DIAGNOSIS: 

PATENT  DUCTUS  ARTERIOSUS 

PREDICTED  CIAGNOSIS: 

PATENT  DUCTUS  ARTERIOSUS  (98.8?) 

OTHER  POSSIBLE  DIAGNOSES: 


/ 

/ 


NONE 


* 


PATIENT 


8 


CL  INIC  IAN:  CP.  GLANZ 
DATE:  12/1C/75 

SYMPTOMS  : 

GE  37  WEEKS  GESTATION 

LE  3  CAYS  OLD 

■MALE 

PULSE  PATE  LE  170 
N  C  0  M  A  L  PULSE  INTENSITY 
NO  DIFFERENTIAL  PULSES 
L  TVER  L  E  2  C v  BELOW  CM 
RESPTPATCPY  RATE  LE  70 
CYANOSIS 

PC 2  INCREASE  WITH  100?  02  GE  30 
PH  GE  7,30 

5 2  SINGLE 

53  ABSENT 

5 4  ABSENT 

EJECTION  CLICK  ABSENT 

NC  DIASTOLIC  MURMER  ' 

HOLOSYSTOLIC  NURMEP  AT  THE  LEFT  LOWER  STERNAL  BORDER 

EJECTION  MUR  ME  R  AT  THE  LEFT  UPPER  STERNAL  BORDER 

PR  INTERVAL  LE  0.11 

QRS  INTERVAL  GT  0.06 

AXIS  181-269 

2/S  IN  VI  GT  10 

INCREASED  PULMONARY  BLOOD  FLOW  WITH  VENOUS  OBSTRUCTION 
MODERATELY  INCREASED  HEART  SIZE 


CLINICAL  CIAGNOSIS: 

TRANSPOSITION  OF  THE  GREAT  ARTERIES 


FINAL  CIAGNOSIS: 

TRANSPOSITION  OF  THE  GREAT  ARTERIFS 


PDCDI CTED  CIAGNOSIS: 


TRANSPOSITION  OF  THE  GREAT  ARTERIES 


( 86.6?) 


OTHER  POSSIBLE  DIAGNOSES: 


VENTRICULAR  SEPTAL  DEFECT 
HYPOPLASTIC  LEFT  HEART 
TR UNCUS  ARTERIOSUS 


(  5.8?) 
(  4.0?) 
(  1.9?) 


PATIENT 


9 


CLINICIAN:  DR.  RESIDENT 

DATE:  11/17/75 

SYMPTOMS: 

Gc  37  WEEKS  GESTATION 
LE  3  DAYS  OLD 
MAL  F 

PUl SF  FATE  LE  170 
NORMAL  PULSE  INTENSITY 
OIRFEF  ENT  IAL  PULSES 
LI VFR  LE  2  CM  BELOW  CM 
NO  DIFFERENTIAL  CYANOSIS 
RESPIPATCDY  RATE  GT  70 
NO  CYANOSIS 
PH  GF  7.30 

52  SPLIT 

53  PRESENT 
S  4  ABSENT 

EJECTION  CLICK  ABSENT  <r  . ' 

NO  CIASTOLIC  MUEMER 

SYSTOLIC  MURMER  AT  THE  LEFT  LCWER  STERNAL  BORDER 
PR  INTERVAL  LE  0.11 
QRS  INTERVAL  LE  0.06 
AXIS  6  0-180 

R/S  IN  VI  LE  10  AND  GE  0.8 
T  IN  VI  GT  o  IN  PATIENT  AGE  GT  3  DAYS 
NORMAL  PULMONARY  BLOCQ  FLCW 
MODERATELY  INCREASED  HEART  SIZE  * 

LEFT  AORTIC  ARCH 


CLINICAL  DIAGNOSIS: 

COARCTATION  WITH  A  VENT  SEPTAL  DEFECT 

FINAL  DIAGNOSIS: 

COARCTATION  WITH  A  VENT  SEPTAL  DEFECT 

PREDICTED  DIAGNOSIS: 

COARCTATION  WITH  A  VENT  SEPTAL  DEFECT  (49.12) 


OT HEP  POSSIBLE  DIAGNOSES: 


COARCTATION  OF  THE  AORTA 
VENTRICULAR  SEPTAL  DEFECT 
AORTIC  STENOSIS 
PATENT  DUCTUS  ARTERIOSUS 


(24.32) 
(  15. 72) 
(  5.02) 
(  4.12) 


PAT  I  ENT 


10 


CLINICIAN:  DR.  RESIDENT 

DATE:  9/30/75  • 

SYMPTOMS : 


LE  36  kFEKS  GESTATION 
GE  4  CAYS  OLD 
FEMALE 

PULSE  PATE  LE  170 
NORMAL  PULSE  INTENSITY 
NO  DIFFERENTIAL  PULSES 
LIVER  LE  2  CM  BELOW  CM 
RESPIRATORY  PATE  GT  70 
NO  CYANOSIS 
PH  GE  7. 30 
S  2  SPLIT 

53  ABSENT 

5 4  ABSENT 

EJECTICN  CLICK  ABSENT 
CONTINUOUS  MURMER  AT  THE  APEX 

C ONT  INDUS  MURMER  AT  THE  LEFT  LQWEF  STERNAL  BORDER 

CCNTINQUS  MURMER  AT  THE  LEFT  UPPER  STERNAL  BORDER 

PR  INTERVAL  LE  0.11 

OF S  INTERVAL  LE  0.06 

AXIS  6C-180 

P/S  IN  VI  LT  0.8 

T  IN  VI  GT  0  IN  PATIENT  AGE  GT  3  DAYS 

INCREASED  PULMONARY  BLOOD  FLOW  W/0  VENOUS  CBSTPUCTION 
NORMAL  HEART  SIZE 
LEFT  ACETIC  ARCH 


CLINICAL  DIAGNOSIS: 

PATENT  DUCTUS  ARTERIOSUS 


PINAL  DIAGNOSIS: 

PATENT  DUCTUS  ARTERIOSUS 

PREDICTED  DIAGNOSIS: 

PATENT  DUCTUS  APTERIOSUS  (97.0?) 

OTHER  POSSIBLE  DIAGNOSES: 


VENTRICULAR  SEPTAL  DEFECT 


(  1.5?) 


/ 


PATIENT 


II 


CLINICIAN:  DR.  NUDEL 

DATE:  12/28/75 

SYMPTOMS  : 

GE  37  WEEKS  GESTATION 
L  E  3  CAYS  OLD 
FEMALE 

PULSE  PATE  LE  170 
NORMAL  PULSE  INTENSITY 
NO  DIFFERENTIAL  PULSES 
LIVER  LE  2  CM  BELCW  CM 
NO  DIFFERENTIAL  CYANOSIS 
RESPIRATORY  PATE  LE  70 
CYANOSIS 

PQ2  INCREASE  WITH  100?  02  LT  30 
PH  GE  7.30 

52  SPLIT 

53  ABSENT 

5 4  ABSENT 

EJECTION  CLICK  A8SENT 
MO  SYSTOLIC  MUPMER 
NO  DIASTOLIC  MURMER 
PR  INTERVAL  LE  0.11 
ORS  INTFPVAL  LE  0.06 
AXIS  181-269 

P./S  IN  VI  LF  10  AND  GE  0.8 
NOP MA l  PULMONARY  BLOOD  FLOW 
MODERATELY  INCREASED  HEART  SIZE 
LEFT  AORTIC  ARCH 


CLINICAL  DIAGNOSIS: 

NON— C  A®  D I  AC  DISEASE 


FINAL  DIAGNOSIS: 

NON-CARDIAC  DISEASE 


PREDICTED  DIAGNOSIS: 

TRANSPOSITION  OF  THE  GREAT  ARTERIES 


(35. 1?) 


OTHER  POSSIBLE  DIAGNOSES: 


NON-CARDIAC  DISEASE  (30.7?) 
PERSISTENT  FETAL  PATHWAYS  (16.9?) 
TOTAL  ANQMOLOLS  PLLM  VENOLS  DRAINAGE  ,  (11.5?) 
PATENT  DUCTUS  ARTERIOSUS  (  1.6?) 
VENTRICULAR  SEPTAL  DEFECT  (  1.1?) 


PATIENT:  12 

CLINICIAN:  OR. 

DATE:  00/00/00 

SYMPTOMS: 

L  E  36  WEEKS  GESTATION 
GE  4  DAYS  CLD 
FEMALE 

PULSE  PATE  LE  170 
NORMAL  PULSE  INTENSITY 
NO  nI  FFFRENT  IAL  PULSES 
LIVER  LF  2  CM  EELOW  CM 
NO  DIFFERENTIAL  CYANOSIS 
RESPIRATORY  RATE  LE  70 

cyanosis 

PH  GE  7.30 

52  SPLIT 

53  ABSENT 

5 4  ABS  ENT 

EJECTION  CLICK  ABSENT  ,  t 

NO  DIASTOLIC  MURMER 

HQLCSYSTOLIC  MURMER  -AT  THt  LEFT  LOWER  STERNAL  BORDER 
PR  INTERVAL  LE  C.ll 
QRS  INTERVAL  LE  0.06 
AXIS  6C-180 

P/S  IN  VI  LF  10  AND  GE  0.8 
T  IN  VI  LE  0  IN  PATIENT  AGE  GT  3  DAYS 

INCREASED  PULMONARY  BLOOD  FLOW  W/C  VENOUS  OBSTRUCTION 
MODERATELY  INCREASED  heart  size 
LEFT  ACPTIC  ARCH 


CLINICAL  DIAGNOSIS: 

VENTRICULAR  SEPTAL  DEFECT 

% 

FINAL  DIAGNOSIS: 

PATENT  DUCTUS  ARTERIOSUS 

PF  EDICTEO  DIAGNOSIS: 

PATENT  DUCTUS  ARTERIOSUS  (49.7?) 

OTHER  POSSIBLE  DIAGNOSES: 


VENTRICULAR  SEPTAL  DEFECT 


( 49.4?) 


APPENDIX  5 


Sample  Patients-Reanalyzed  with 
Incidence  Data  Omitted 


PAT  I  ENT 


2 


CLINICIAN:  DR.  GLANZ 

DATE:  10/26/75 

SYMPTOMS : 

GE  37  WEEKS  GESTATION 
GE  4  CAYS  OLD 
FEMALE 

PULSE  ° A TE  GT  170 
NORMAL  PULSE  INTENSITY 
MO  DIFFERENTIAL  PULSES 
L  I VEP  GT  2  CM  BELOW  CM 
RESPIRATORY  PATE  GT  70 
NO  CYANOSIS 
PH  GE  7.30 

5 2  SPLIT 

53  ABSENT 

5 4  ABSENT  ' 

EJECTICN  CLICK  PRESENT 

NC  DIASTOLIC  MURMER  * 

SYSTOLIC  MURMER  AT  THE  LEFT  LOWER  STERNAL  BORDER 
SYSTOLIC  MURMER  AT  THE  LEFT  UPPER  STERNAL  BORDER 
PD  INTERVAL  LE  0. 11 
QES  INTERVAL  LE  0.06 
AXIS  181-269 

P/S  IN  VI  LE  10  AND  GE  0.8 
T  IN  VI  GT  0  IN  PATIENT  AGE  GT  3  DAYS 

INCREASED  PULMONARY  BLOOD  FLOW  W/C  VENOUS  OBSTRUCTION 
MODERATELY  INCREASED  HEART  SIZE 
LEFT  AGPTIC  ARCH 


CLINICAL  DIAGNOSIS: 

EN COCARD  I &L  CUSHION  DEFECT 


FINAL  DIAGNOSIS: 

ENDOCARDIAL  CUSHION  DEFECT 


PREDICTED  DIAGNOSIS: 


ENDOCARDIAL  CUSHION  DEFECT  (29.6?) 

OTHFR  POSSIBLE  DIAGNOSES: 

VENTRICULAR  SEPTAL  DEFECT  (20.4?) 

TOTAL  ANCMOLGUS  PULM  VENOUS  DRAINAGE  (  14.9?) 

COARCTATION  WITH  A  VENT  SEPTAL  DEFECT  (11.2?) 

AO  RT IC  STENOSIS  '  (  7.7?) 

PULMONARY  STENOSIS  (  6.4?) 

TRICUSPID  ATRESIA  WITH  VSD  (  3.3?) 


PAT  I  ENT 


5 


CLINICIAN:  DR.  TALNER 

DATE:  11/25/75 

SYMPTOMS: 

G6  37  WEEKS  GESTATIGN 

GE  4  DAYS  OLD 

MALE 

'  PULSE  RATE  LE  170 

INCREASED  PULSE  INTENSITY 
NO  DIFFERENTIAL  PULSES 
LIVER  LE  2  CM  BELOW  CM 
NO  DIFFERENTIAL  CYANOSIS  .  . 

RESPIRATORY  RATE  GT  70 
CYANOS  IS 

P02  INCREASE  WITH  1003  02  GE  30 
-  PH  GE  7. 30 

52  S  INGLE 

53  ABSENT 

54  ABSENT 

EJECTION  CLICK  ABSENT 
NO  DIASTOLIC  MURMER 

EJECTION  MURMER  AT  THE  LEFT  UPPER  STERNAL  BORDER 
PR  INTERVAL  LE  0.11 
QRS  INTERVAL  LE  0.  06- 
AX  IS  60-180 

P./S  IN  VI  LE  10  AND  GE  0.8 
T  IN  Vi  LE  0  IN  PATIENT  AGE  GT  3  DAYS 

INCREASED  PULMONARY  BLOOD  FLOW  WITH  VENOUS  OBSTRUCTION 
MARKECLY  INCREASED  HEART  SIZE 
LEFT  AORTIC  ARCH 


CLINICAL  DIAGNOSIS: 

PATENT  DUCTUS  ARTERIOSUS 

=INAL  DIAGNOSIS: 

TRUNCUS  ARTERIOSUS 

PREDICTED  DIAGNOSIS: 

TRUNCUS  ARTERIOSUS 


( 77.83) 


OTHER  POSSIBLE  DIAGNOSES: 


VENTRICULAR  SEPTAL  DEFECT  (  7.53) 
PATENT  DUCTUS  ARTERIOSUS  (  .5.53) 
COARCTATION  OF  THE  AORTA  (  3.43) 
TRANSPOSITION  OF.  THE  GREAT  ARTERIES  •'  (  1.73) 
COARCTATIHN  WITH  A  VENT  SEPTAL  DEFECT  (  1.13) 


le 
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